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Recommended Books and Resources 


e L. Hand and J. Finch, Analytical Mechanics 


This very readable book covers everything in the course at the right level. It is similar 
to Goldstein’s book in its approach but with clearer explanations, albeit at the expense 
of less content. 


There are also three classic texts on the subject 
e H. Goldstein, C. Poole and J. Safko, Classical Mechanics 


In previous editions it was known simply as “Goldstein” and has been the canonical 
choice for generations of students. Although somewhat verbose, it is considered the 
standard reference on the subject. Goldstein died and the current, third, edition found 
two extra authors. 


e L. Landau an E. Lifshitz, Mechanics 


This is a gorgeous, concise and elegant summary of the course in 150 content packed 
pages. Landau is one of the most important physicists of the 20th century and this is 
the first volume in a series of ten, considered by him to be the “theoretical minimum” 
amount of knowledge required to embark on research in physics. In 30 years, only 43 
people passed Landau’s exam! 


A little known fact: Landau originally co-authored this book with one of his students, 
Leonid Pyatigorsky. They subsequently had a falling out and the authorship was 
changed. There are rumours that Pyatigorsky got his own back by denouncing Landau 
to the Soviet authorities, resulting in his arrest. 


e V.I. Arnold, Mathematical Methods of Classical Mechanics 


Arnold presents a more modern mathematical approach to the topics of this course, 
making connections with the differential geometry of manifolds and forms. It kicks off 
with “The Universe is an Affine Space” and proceeds from there... 
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1. Newton’s Laws of Motion 


“So few went to hear him, and fewer understood him, that oftimes he did, 
for want of hearers, read to the walls. He usually stayed about half an hour; 
when he had no auditors he commonly returned in a quarter of that time.” 


Appraisal of a Cambridge lecturer in classical mechanics, circa 1690 


1.1 Introduction 


The fundamental principles of classical mechanics were laid down by Galileo and New- 
ton in the 16° and 17‘ centuries. In 1686, Newton wrote the Principia where he 
gave us three laws of motion, one law of gravity and pretended he didn’t know cal- 
culus. Probably the single greatest scientific achievement in history, you might think 
this pretty much wraps it up for classical mechanics. And, in a sense, it does. Given 
a collection of particles, acted upon by a collection of forces, you have to draw a nice 
diagram, with the particles as points and the forces as arrows. The forces are then 
added up and Newton’s famous “F = ma” is employed to figure out where the par- 
ticle’s velocities are heading next. All you need is enough patience and a big enough 
computer and you’re done. 


From a modern perspective this is a little unsatisfactory on several levels: it’s messy 
and inelegant; it’s hard to deal with problems that involve extended objects rather than 
point particles; it obscures certain features of dynamics so that concepts such as chaos 
theory took over 200 years to discover; and it’s not at all clear what the relationship is 
between Newton’s classical laws and quantum physics. 


The purpose of this course is to resolve these issues by presenting new perspectives 
on Newton’s ideas. We shall describe the advances that took place during the 150 
years after Newton when the laws of motion were reformulated using more powerful 
techniques and ideas developed by some of the giants of mathematical physics: people 
such as Euler, Lagrange, Hamilton and Jacobi. This will give us an immediate practical 
advantage, allowing us to solve certain complicated problems with relative ease (the 
strange motion of spinning tops is a good example). But, perhaps more importantly, 
it will provide an elegant viewpoint from which we’ll see the profound basic principles 
which underlie Newton’s familiar laws of motion. We shall prise open “F = ma” to 
reveal the structures and symmetries that lie beneath. 


Moreover, the formalisms that we’ll develop here are the basis for all of fundamental 
modern physics. Every theory of Nature, from electromagnetism and general relativity, 
to the standard model of particle physics and more speculative pursuits such as string 
theory, is best described in the language we shall develop in this course. The new 
formalisms that we’ll see here also provide the bridge between the classical world and 
the quantum world. 


There are phenomena in Nature for which these formalisms are not particularly 
useful. Systems which are dissipative, for example, are not so well suited to these 
new techniques. But if you want to understand the dynamics of planets and stars and 
galaxies as they orbit and spin, or you want to understand what’s happening at the 
LHC where protons are collided at unprecedented energies, or you want to know how 
electrons meld together in solids to form new states of matter, then the foundations 
that we'll lay in in this course are a must. 


1.2 Newtonian Mechanics: A Single Particle 


In the rest of this section, we’ll take a flying tour through the basic ideas of classical 
mechanics handed down to us by Newton. More details can be found in the lectures 
on Dynamics and Relativity. 


We'll start with a single particle.. A particle is defined to be an object of insignificant 
size. e.g. an electron, a tennis ball or a planet. Obviously the validity of this statement 
depends on the context: to first approximation, the earth can be treated as a particle 
when computing its orbit around the sun. But if you want to understand its spin, it 
must be treated as an extended object. 


The motion of a particle of mass m at the position r is governed by Newton’s Second 
Law F = ma or, more precisely, 


F(r,t) =p (1.1) 


where F is the force which, in general, can depend on both the position r as well as 
the velocity r (for example, friction forces depend on r) and p = mr is the momentum. 
Both F and p are 3-vectors which we denote by the bold font. Equation (1.1) reduces 
to F = ma if m = 0. But if m = m(t) (e.g. in rocket science) then the form with p is 
correct. 


General theorems governing differential equations guarantee that if we are given r 
and r at an initial time t = to, we can integrate equation (1.1) to determine r(t) for all 
t (as long as F remains finite). This is the goal of classical dynamics. 


Equation (1.1) is not quite correct as stated: we must add the caveat that it holds 
only in an inertial frame. This is defined to be a frame in which a free particle with 
m = 0 travels in a straight line, 


r = ro + vt (1.2) 
Newtons’s first law is the statement that such frames exist. 


An inertial frame is not unique. In fact, there are an infinite number of inertial frames. 
Let S be an inertial frame. Then there are 10 linearly independent transformations 
S — S' such that S’ is also an inertial frame (i.e. if (1.2) holds in S, then it also holds 
in S”). These are 


e 3 Rotations: r’=Or_ where O is a3 x 3 orthogonal matrix. 


e 3 Translations: r’=r-+c_ fora constant vector c. 
e 3 Boosts: r’=r-+ut for a constant velocity u. 
e | Time Translation: t =t+c for a constant real number c 


If motion is uniform in S, it will also be uniform in S’. These transformations make 
up the Galilean Group under which Newton’s laws are invariant. They will be impor- 
tant in section 2.4 where we will see that these symmetries of space and time are the 
underlying reason for conservation laws. As a parenthetical remark, recall from special 
relativity that Einstein’s laws of motion are invariant under Lorentz transformations 
which, together with translations, make up the Poincaré group. We can recover the 
Galilean group from the Poincaré group by taking the speed of light to infinity. 


1.2.1 Angular Momentum 


We define the angular momentum L of a particle and the torque T acting upon it as 
L=rxp f T=rxF (1.3) 


Note that, unlike linear momentum p, both L and T depend on where we take the 
origin: we measure angular momentum with respect to a particular point. Let us cross 
both sides of equation (1.1) with r. Using the fact that r is parallel to p, we can write 
fir xp) =r xp. Then we get a version of Newton’s second law that holds for angular 
momentum: 


T=L (1.4) 


1.2.2 Conservation Laws 


From (1.1) and (1.4), two important conservation laws follow immediately. 
e If F = 0 then p is constant throughout the motion 
e If 7 = 0 then L is constant throughout the motion 


Notice that 7 = 0 does not require F = 0, but only r x F = 0. This means that F 
must be parallel to r. This is the definition of a central force. An example is given by 
the gravitational force between the earth and the sun: the earth’s angular momentum 
about the sun is constant. As written above in terms of forces and torques, these 
conservation laws appear trivial. In section 2.4, we’ll see how they arise as a property 
of the symmetry of space as encoded in the Galilean group. 


1.2.3 Energy 
Let’s now recall the definitions of energy. We firstly define the kinetic energy T as 


P= ae (1.5) 
Suppose from now on that the mass is constant. We can compute the change of kinetic 
energy with time: a = p-r = F -r. If the particle travels from position rı at time tı 


to position rg at time ty then this change in kinetic energy is given by 


t2 dT t2 r2 
Tt) -T= f sue | Peidt= | F- dr (1.6) 


ty ty rı 


where the final expression involving the integral of the force over the path is called the 
work done by the force. So we see that the work done is equal to the change in kinetic 
energy. From now on we will mostly focus on a very special type of force known as a 
conservative force. Such a force depends only on position r rather than velocity r and 
is such that the work done is independent of the path taken. In particular, for a closed 
path, the work done vanishes. 


§ Fedr=0 © VxF=0 (1.7) 


It is a deep property of flat space R? that this property implies we may write the force 
as 


F =-VVi(r) (1.8) 


for some potential V(r). Systems which admit a potential of this form include gravi- 
tational, electrostatic and interatomic forces. When we have a conservative force, we 


necessarily have a conservation law for energy. To see this, return to equation (1.6) 
which now reads 


T(t.) -T(t))=— | VV -dr =—V(tz) +V (ty) (1.9) 


rı 


or, rearranging things, 
T(t) + V (t1) = T(t2) + V (t2) = E (1.10) 


So E = T + V is also a constant of motion. It is the energy. When the energy is 
considered to be a function of position r and momentum p it is referred to as the 
Hamiltonian H. In section 4 we will be seeing much more of the Hamiltonian. 


1.2.4 Examples 


e Example 1: The Simple Harmonic Oscillator 


This is a one-dimensional system with a force proportional to the distance x to the 
origin: F(x) = —ka. This force arises from a potential V = $ka?. Since F # 0, 
momentum is not conserved (the object oscillates backwards and forwards) and, since 
the system lives in only one dimension, angular momentum is not defined. But energy 
E = m2? + kx? is conserved. 


e Example 2: The Damped Simple Harmonic Oscillator 


We now include a friction term so that F(x, £) = —kax—ya. Since F is not conservative, 
energy is not conserved. This system loses energy until it comes to rest. 


e Example 3: Particle Moving Under Gravity 


Consider a particle of mass m moving in 3 dimensions under the gravitational pull of 
a much larger particle of mass M. The force is F = —(GMm/r?)? which arises from 
the potential V = —GMm/r. Again, the linear momentum p of the smaller particle 
is not conserved, but the force is both central and conservative, ensuring the particle’s 
total energy Æ and the angular momentum L are conserved. 


1.3 Newtonian Mechanics: Many Particles 


It’s easy to generalise the above discussion to many particles: we simply add an index 
to everything in sight! Let particle 7 have mass m; and position r; where i = 1,..., N 
is the number of particles. Newton’s law now reads 


where F; is the force on the i* particle. The subtlety is that forces can now be working 
between particles. In general, we can decompose the force in the following way: 


F; =) Fy + FR (1.12) 
j#i 


where F;; is the force acting on the i particle due to the j™ particle, while F®* is the 
external force on the it! particle. We now sum over all N particles 


Dr- D mDr 


i,j with jÆi 


X (Fy + Fi) D (1.13) 


i<j 


where, in the second line, we’ve re-written the sum to be over all pairs i < j. At this 
stage we make use of Newton’s third law of motion: every action has an equal and 
opposite reaction. Or, in other words, F;; = —F;;. We see that the first term vanishes 
and we are left simply with 


D =F™ (1.14) 


where we’ve defined the total external force to be F® = X`, F&**. We now define the 


total mass of the system M = $; m; as well as the centre of mass R 
) : Mifi 
R = a" 1.15 


Then using (1.11), and summing over all particles, we arrive at the simple formula, 
F® = MR (1.16) 


which is identical to that of a single particle. This is an important formula. It tells that 
the centre of mass of a system of particles acts just as if all the mass were concentrated 
there. In other words, it doesn’t matter if you throw a tennis ball or a very lively cat: 
the center of mass of each traces the same path. 


1.3.1 Momentum Revisited 


The total momentum is defined to be P = 5°, p; and, from the formulae above, it is 
simple to derive P = F“*. So we find the conservation law of total linear momentum 
for a system of many particles: P is constant if F®* vanishes. 


Similarly, we define total angular momentum to be L = 5°, L;. Now let’s see what 
happens when we compute the time derivative. 


L = Xr x Pi 
= ` r; x z Ey + ro) (1.17) 


} j#t 


X r; x Fi + Sori x E~ (1.18) 


ijwithižj 


The last term in this expression is the definition of total external torque: T° = X`; r; X 
Ft. But what are we going to do with the first term on the right hand side? Ideally we 
would like it to vanish! Let’s look at the circumstances under which this will happen. 
We can again rewrite it as a sum over pairs 7 < 7 to get 


N (r-r) x Fy (1.19) 


i<j 


which will vanish if and only if the force F;, is parallel to the line joining to two particles 
(r; —r,;). This is the strong form of Newton’s third law. If this is true, then we have a 
statement about the conservation of total angular momentum, namely L is constant if 
T0: 


Most forces do indeed obey both forms of Newton’s third law: 1 
F;; = —F ;; and F,; is parallel to (r;—r;). For example, gravitational 
and electrostatic forces have this property. And the total momentum 
and angular momentum are both conserved in these systems. But 
some forces don’t have these properties! The most famous example 
is the Lorentz force on two moving particles with electric charge Q. 
This is given by, 


F;;j = Qv; x B; (1.20) 


= 


Figure 1: The 
magnetic field for 


where v; is the velocity of the i‘ particle and B; is the magnetic 
field generated by the j‘* particle. Consider two particles crossing 
each other in a “T” as shown in the diagram. The force on particle f 
; : ‘ ; two particles. 
1 from particle 2 vanishes. Meanwhile, the force on particle 2 from 


particle 1 is non-zero, and in the direction 


Fo ~ + XQ x (1.21) 


Does this mean that conservation of total linear and angular momentum is violated? 
Thankfully, no! We need to realise that the electromagnetic field itself carries angular 
momentum which restores the conservation law. Once we realise this, it becomes a 
rather cheap counterexample to Newton’s third law, little different from an underwater 
swimmer who can appear to violate Newton’s third law if we don’t take into account 
the momentum of the water. 


1.3.2 Energy Revisited 


The total kinetic energy of a system of many particles is T = 5 oy Mak? Let us 


i 


decompose the position vector r; as 
r= R+i; (1.22) 


where fr; is the distance from the centre of mass to the particle i. Then we can write 
the total kinetic energy as 


T=iMŘR +Y mit; (1.23) 


Which shows us that the kinetic energy splits up into the kinetic energy of the centre 
of mass, together with an internal energy describing how the system is moving around 
its centre of mass. As for a single particle, we may calculate the change in the total 
kinetic energy, 


i iZj 
Like before, we need to consider conservative forces to get energy conservation. But 
now we need both 


e Conservative external forces: F&* = —V;V;(r1,..., ry) 


e Conservative internal forces: F;; = —V;V;;(r1,..., ry) 


where V; = 0/Or;. To get Newton’s third law F;; = —F,; together with the requirement 
that this is parallel to (r;—r,;), we should take the internal potentials to satisfy V;; = Vj: 
with 


Vij(t1,.-- Tw) = Vij(\ri — r;l) (1.25) 


so that V;; depends only on the distance between the i‘ and j‘" particles. We also 
insist on a restriction for the external forces, V;(r1,...,ry) = V;(r;), so that the force 
on particle ¿ does not depend on the positions of the other particles. Then, following 
the steps we took in the single particle case, we can define the total potential energy 
V =}; Vi + Die; Vij and we can show that H = T + V is conserved. 


1.3.3 An Example 

Let us return to the case of gravitational attraction between two bodies but, unlike 
in Section 1.2.4, now including both particles. We have T = imi? + imiz. The 
potential is V = —GmımM/|rı — r2|. This system has total linear momentum and total 


angular mometum conserved, as well as the total energy H = T +V. 


2. The Lagrangian Formalism 


When I was in high school, my physics teacher called me down one day after 
class and said, “You look bored, I want to tell you something interesting”. 
Then he told me something I have always found fascinating. Every time 


the subject comes up I work on it. 
l p Richard Feynman 


Feynman’s teacher told him about the “Principle of Least Action”, one of the most 
profound results in physics. 


2.1 The Principle of Least Action 


Firstly, let’s get our notation right. Part of the power of the Lagrangian formulation 
over the Newtonian approach is that it does away with vectors in favour of more general 
coordinates. We start by doing this trivially. Let’s rewrite the positions of N particles 
with coordinates r; as x^ where A = 1,...3N. Then Newton’s equations read 


Pa (2.1) 


where pa = mat. The number of degrees of freedom of the system is said to be 3N. 
These parameterise a 3N-dimensional space known as the configuration space C. Each 
point in C specifies a configuration of the system (i.e. the positions of all N particles). 
Time evolution gives rise to a curve in C. 


Figure 2: The path of particles in real space (on the left) and in configuration space (on the 
right). 


The Lagrangian 


Define the Lagrangian to be a function of the positions z^ and the velocities t4 of all 
the particles, given by 


Leta) = T (2^) — Va) (2.2) 


— 10 — 


where T = 45>, ma(a4)? is the kinetic energy, and V(«“) is the potential energy. 
Note the minus sign between T and V! To describe the principle of least action, we 


consider all smooth paths «4(t) in C with fixed end points so that 
Th) = and ee (2.3) 


Vinitial 
Of all these possible paths, only one is the true path + 


taken by the system. Which one? To each path, let us X sina 
assign a number called the action S defined as 


S{x4(é)) = f "Drue (2.4) 


The action is a functional (i.e. a function of the path which x 


is itself a function). The principle of least action is the fol- n 
: Figure 3: 
lowing result: 


Theorem (Principle of Least Action): The actual path taken by the system is an 
extremum of S. 


Proof: Consider varying a given path slightly, so 
a(t) > A(t) + ôr (t) (2.5) 


where we fix the end points of the path by demanding dx4(t;) = da4(ts) = 0. Then 
the change in the action is 


tf 
ôS =ô pi Lat| 
a 
= i: OL dt 
ti 


= i 2.6 
[ (seat ga ot ) dt (2.6) 
At this point we integrate the second term by parts to get 
SOM d/ðL i aL ts 
= — —ĝr^ 2.7 
ôS [ (= T (sex) ) 8 de | spade (2.7) 


But the final term vanishes since we have fixed the end points of the path so dr4(t;) = 
ôx (ts) =0. The requirement that the action is an extremum says that 5S = 0 for all 
changes in the path da“4(t). We see that this holds if and only if 


aL ad (aL 
Or4 dt \ 0x4 


) =0 for each A= 1,...3N (2.8) 


— |1- 


These are known as Lagrange’s equations (or sometimes as the Euler-Lagrange equa- 


tions). To finish the proof, we need only show that Lagrange’s equations are equivalent 
to Newton’s. From the definition of the Lagrangian (2.2), we have 0L/0x4 = —OV/0zx4, 


while 
(2.1): 


OL/dx4 = p4. It’s then easy to see that equations (2.8) are indeed equivalent to 


Some remarks on this important result: 


This is an example of a variational principle which you already met in the epony- 
mous “variational principles” course. 


The principle of least action is a slight misnomer. The proof only requires that 
ôS = 0, and does not specify whether it is a maxima or minima of S. Since 
L = T — V, we can always increase S by taking a very fast, wiggly path with 
T > 0, so the true path is never a maximum. However, it may be either a 
minimum or a saddle point. So “Principle of stationary action” would be a more 
accurate, but less catchy, name. It is sometimes called “Hamilton’s principle”. 


All the fundamental laws of physics can be written in terms of an action principle. 
This includes electromagnetism, general relativity, the standard model of particle 
physics, and attempts to go beyond the known laws of physics such as string 
theory. For example, (nearly) everything we know about the universe is captured 
in the Lagrangian 


L= yg (R— EF F" + Dy) (2.9) 


where the terms carry the names of Einstein, Maxwell (or Yang and Mills) and 
Dirac respectively, and describe gravity, the forces of nature (electromagnetism 
and the nuclear forces) and the dynamics of particles like electrons and quarks. 
If you want to understand what the terms in this equation really mean, then you 
can find explanations in the lectures on General Relativity, Electromagnetism, 
and Quantum Field Theory. 


There is a beautiful generalisation of the action principle to quantum mechan- 
ics due to Feynman in which the particle takes all paths with some probability 
determined by S. We will describe this in Section 4.8. 


Back to classical mechanics, there are two very important reasons for working with 
Lagrange’s equations rather than Newton’s. The first is that Lagrange’s equations 
hold in any coordinate system, while Newton’s are restricted to an inertial frame. 
The second is the ease with which we can deal with constraints in the Lagrangian 
system. We’ll look at these two aspects in the next two subsections. 


= [2:= 


2.2 Changing Coordinate Systems 


We shall now show that Lagrange’s equations hold in any coordinate system. In fact, 
this follows immediately from the action principle, which is a statement about paths 
and not about coordinates. But here we shall be a little more pedestrian in order to 
explain exactly what we mean by changing coordinates, and why it’s useful. Let 


qa Hal Onc a) (2.10) 


where we’ve included the possibility of using a coordinate system which changes with 
time t. Then, by the chain rule, we can write 


. qa Oa a Oqa 
dE ise OF 


In this equation, and for the rest of this course, we’re using the “summation convention” 


(2.11) 


in which repeated indices are summed over. Note also that we won’t be too careful 
about whether indices are up or down - it won’t matter for the purposes of this course. 
To be a good coordinate system, we should be able to invert the relationship so that 
x^ = x^ (qa, t) which we can do as long as we have det(3x4/3qa) 4 0. Then we have, 


-A Ox4 A Ox4 


im Aled 


7a, (13) 


Now we can examine L(x^, #4) when we substitute in x^ (qa, t). Using (2.12) we have 


ðL OL Ox _ OL ori | x^ (2.13) 
ðq Oxi OG, | 024 \q.0n ðh 
while 
OL OL Oz4 
an (2.14) 


We now use the fact that we can “cancel the dots” and 0%4/0qq = 0x4/0qq which we 
can prove by substituting the expression for <4 into the LHS. Taking the time derivative 
of (2.14) gives us 


dfo Af oby , OL [Oa a . oa (2.15) 
di \Og,) di\de4) ðq O24 \Oq.0n $ T Aq.dt 
So combining (2.13) with (2.15) we find 
d (OL\ ƏL [d (OL\ aL] dxA eis 
dt \ Oda Oqa  |dt \O«aA O24) Oda 
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Equation (2.16) is our final result. We see that if Lagrange’s equation is solved in the 
x^ coordinate system (so that [...] on the RHS vanishes) then it is also solved in the 
da coordinate system. (Conversely, if it is satisfied in the qa coordinate system, so the 
LHS vanishes, then it is also satisfied in the z^ coordinate system as long as our choice 
of coordinates is invertible: i.e det(Ox4/0q,.) Æ 0). 


So the form of Lagrange’s equations holds in any coordinate system. This is in 
contrast to Newton’s equations which are only valid in an inertial frame. Let’s illustrate 
the power of this fact with a couple of simple examples 


2.2.1 Example: Rotating Coordinate Systems 


Consider a free particle with Lagrangian given by 
L= im? (2.17) 


with r = (x,y,z). Now measure the motion of the particle with respect to a coordinate 
system which is rotating with angular velocity w = (0,0,w) about the z axis. If 
r’ = (2’,y’, 2’) are the coordinates in the rotating system, we have the relationship 


x’ = xcoswt + ysin wt 


/ 


y coswt — x sin wt 
ge = 2 (2.18) 


Then we can substitute these expressions into the Lagrangian to find L in terms of the 
rotating coordinates, 


L= sm (a" 5 wy")? + (y/ + wa")? -+ Z| = im(i” +w x r’)? (2.19) 


In this rotating frame, we can use Lagrange’s equations to derive the equations of 
motion. Taking derivatives, we have 


a = m(t x w—w x (w x r’)) 
£ @ = mi +w x Pr) (2.20) 
so Lagrange’s equation reads 
(5) -ÎL = mii? +wx (w xr) + 2w xF) =0 (2.21) 


The second and third terms in this expression are the centrifugal and coriolis forces 
respectively. These are examples of the “fictitious forces” that you were warned about in 
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Figure 4: In the northern hemisphere, a particle is deflected in a clockwise direction; in the 
southern hemisphere in an anti-clockwise direction. 


the first year. They’re called fictitious because they’re a consequence of the reference 
frame, rather than any interaction. But don’t underestimate their importance just 
because they’re “fictitious”! According to Einstein’s theory of General Relativity, the 
force of gravity is on the same footing as these fictitious forces. 


The centrifugal force Foon, = —mw x (w x r’) points outwards in the plane perpen- 
dicular to w with magnitude mw?|r’, | = m|v_|*/ |r| where ı denotes the projection 
perpendicular to w. 


The coriolis force Feor = —2mw x fr’ is responsible for the large scale circulation of 
oceans and the atmosphere. For a particle travelling on the surface of the rotating 
earth, the direction of the coriolis force is drawn in figure 4. We see that a particle 
thrown in the northern hemisphere will be seen to rotate in a clockwise direction; a 
particle thrown in the southern hemisphere rotates in an anti-clockwise direction. For 
a particle moving along the equator, the coriolis force points directly upwards, so has 
no effect on the particle. 


More details on the effect of the Coriolis force in various circumstances can be found 
in the Dynamics and Relativity lecture notes. Questions discussed include: 


e The coriolis force is responsible for the formation of hurricanes. These rotate 
in different directions in the northern and southern hemisphere, and never form 
within 500 miles of the equator where the coriolis force is irrelevant. But hur- 
ricanes rotate anti-clockwise in the northern hemisphere. This is the opposite 
direction from what we deduced above for a projected particle! What did we 
miss? 
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e Estimate the magnitude of the coriolis force. Do you think that it really affects 
the motion of water going down a plughole? What about the direction in which 
a CD spins? 


e Stand on top of a tower at the equator and drop a ball. As the ball falls, the 
earth turns underneath from west to east. Does the ball land 
1. At the base of the tower? 
2. To the east? 
3. To the west? 


2.2.2 Example: Hyperbolic Coordinates 


A particle moves in the (x,y) plane with a force A 
directed towards the origin O with magnitude propor- 
tional to the distance from O. How does it move? In 
Cartesian coordinates, this problem is easy. We have 


the Lagrangian | - 


L= im(a* +97) — tk(a* +y’) (2.22) 


Let’s set m = k = 1 for simplicity. The equation of . 


motion for this system is simply Figure 5: Hyperbolic coordi- 


nates. 


£=-r and y=-y (2.23) 


Now suppose we want to know the motion of the system in hyperbolic coordinates 
defined as 


2ry=u , gy =À (2.24) 


The coordinates u and À are curvilinear and orthogonal (i.e. two hyperbolics intersect 
at 90°). We could try solving this problem by substituting the change of coordinates 
directly into the equations of motion. It’s a mess. (Try if you don’t believe me!). 
A much simpler way is to derive expressions for x,y,z and y in terms of the new 
coordinates and substitute into the Lagrangian to find, 

L=} zien ae A2 + pi? (2.25) 


8 [X++ 2 
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From which we can easily derive the equation of motion for A 


dfOL\ ƏL d À E E ERE SEE E 
di\ai/ Or layare] 3 Miner 202+ ws 
= (2.26) 


Which is also a mess! But it’s a mess that was much simpler to derive. Moreover, we 
don’t need to do any more work to get the second equation for u: the symmetry of the 
Lagrangian means that it must be the same as (2.26) with \ + pu interchanged. 


2.3 Constraints and Generalised Coordinates 


Now we turn to the second advantage of the Lagrangian formulation. In writing p; = 
—ViV, we implicitly assume that each particle can happily roam anywhere in space 
R’. What if there are constraints? In Newtonian mechanics, we introduce “constraint 
forces”. These are things like the tension of ropes, and normal forces applied by surfaces. 
In the Lagrangian formulation, we don’t have to worry about such things. In this 
section, we’ll show why. 


An Example: The Pendulum 


The simple pendulum has a single dynamical degree of freedom 
0, the angle the pendulum makes with the vertical. The position of 


the mass m in the plane is described by two cartesian coordinates x paged 
and y subject to a constraint 77+y? = l?. We can parameterise this 
as x = lsin and y = Icos@. Employing the Newtonian method 
to solve this system, we introduce the tension T as shown in the 


diagram and resolve the force vectors to find, 


më =-Tr/l , my = mg — Ty/l (2.27) Saig 


. ; f : Fi 6: 
To determine the motion of the system, we impose the constraints eee 


at the level of the equation of motion, and then easily find 
6=-(g/l)sn@ , T=mlé?+mgcosé (2.28) 


While this example was pretty straightforward to solve using Newtonian methods, 
things get rapidly harder when we consider more complicated constraints (and we’ll 
see plenty presently). Moreover, you may have noticed that half of the work of the 
calculation went into computing the tension T. On occasion we'll be interested in this. 
(For example, we might want to know how fast we can spin the pendulum before it 


= i= 


breaks). But often we won’t care about these constraint forces, but will only want to 
know the motion of the pendulum itself. In this case it seems like a waste of effort to 
go through the motions of computing T. We’ll now see how we can avoid this extra 
work in the Lagrangian formulation. Firstly, let’s define what we mean by constraints 
more rigorously. 


2.3.1 Holonomic Constraints 


Holonomic Constraints are relationships between the coordinates of the form 
falza, t) =0 a=1,...,3N -n (2.29) 


In general the constraints can be time dependent and our notation above allows for 
this. Holonomic constraints can be solved in terms of n generalised coordinates qi, 
T= Ne oc SO 


TA = TAQ,- -, qn) (2.30) 


The system is said to have n degrees of freedom. For the pendulum example above, 
the system has a single degree of freedom, q = 8. 


Now let’s see how the Lagrangian formulation deals with constraints of this form. We 
introduce 3N — n new dynamical degrees of freedom, a(t). These are called Lagrange 
multipliers. Note that each sits on the same footing as the original x4 (t) in the sense 
that they are functions of time. We now define a new Lagrangian 


L' = L(a4, £^) + àafalz^, t) (2.31) 


We treat Aq like new coordinates. Since L’ doesn’t depend on Àa, Lagrange’s equations 
for Aq are 


OL 
A falz, So (2.32) 


which gives us back the constraints. Meanwhile, the equations for x“ are 


d ( OL E ƏL Ofa 
dt \ 0x4 OrA | ðx4 


(2.33) 


The LHS is the equation of motion for the unconstrained system. The RHS is the 
manifestation of the constraint forces in the system. We can now solve these equations 
as we did in the Newtonian formulation. 
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The Pendulum Example Again 


The Lagrangian for the pendulum is given by that for a free particle moving in the 
plane, augmented by the Lagrange multiplier term for the constraints. It is 


=ne +9") + mgy + 5X2" ty — PV) (2.34) 
From which we can calculate the two equations of motion for x and y, 
mi = Ax and ğü = mg + ày (2.35) 


while the equation of motion for \ reproduces the constraint x?+y?—1? = 0. Comparing 
with the Newtonian approach (2.27), we again see that the Lagrange multiplier A is 
proportional to the tension: \ = —T’/I. 


So we see that we can easily incorporate constraint forces into the Lagrangian setup 
using Lagrange multipliers. But the big news is that we don’t have to! Often we don’t 
care about the tension T or other constraint forces, but only want to know what the 
generalised coordinates q; are doing. In this case we have the following useful theorem 


Theorem: For constrained systems, we may derive the equations of motion directly 
in generalised coordinates q; 


Proof: Let’s work with L’ = L + Aafa and change coordinates to 
i C= 1, syed 
TER a (2.37) 
a &=l,...3N -n 


We know that Lagrange’s equations take the same form in these new coordinates. In 
particular, we may look at the equations for qi, 


d (OL OL dfa 
dt \ ðq; Og | ðqi 


(2.38) 


But, by definition, ô fa/qı = 0. So we are left with Lagrange’s equations purely in 
terms of q;, with no sign of the constraint forces. If we are only interested in the 
dynamics of the generalised coordinates qi, we may ignore the Lagrange multipliers 
and work entirely with the unconstrained Lagrangian L(q;, qi, t) defined in (2.36) where 


we just substitute in z4 = x,4(qi,t). 
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The Pendulum Example for the Last Time 


Let’s see how this works in the simple example of the pendulum. We can parameterise 
the constraints in terms of the generalised coordinate 0 so that x = lsin and y = 
lcos@. We now substitute this directly into the Lagrangian for a particle moving in 
the plane under the effect of gravity, to get 


L = im(é? + 4°) + mgy 


= Lm? + mgl cos 0 (2.39) 


From which we may derive Lagrange’s equations using the coordinate 0 directly 


d (OL OL 25 
= (55) 70 ml 0 + mgl sind = 0 (2.40) 


which indeed reproduces the equation of motion for the pendulum (2.28). Note that, 
as promised, we haven’t calculated the tension T using this method. This has the 
advantage that we’ve needed to do less work. If we need to figure out the tension, we 
have to go back to the more laborious Lagrange multiplier method. 


2.3.2 Non-Holonomic Constraints 


For completeness, let’s quickly review a couple of non-holonomic constraints. There’s 
no general theory to solve systems of this type, although it turns out that both of the 
examples we describe here can be solved with relative ease using different methods. We 
won’t discuss non-holonomic constraints for the rest of this course, and include a brief 
description here simply to inform you of the sort of stuff we won’t see! 


Inequalities 


Consider a particle moving under gravity on the outside of a sphere of radius R. It is 
constrained to satisfy 7?+-y?+z? > R?. This type of constraint, involving an inequality, 
is non-holonomic. When the particle lies close to the top of the sphere, we know that 
it will remain in contact with the surface and we can treat the constraint effectively as 
holonomic. But at some point the particle will fall off. To determine when this happens 
requires different methods from those above (although it is not particularly difficult). 


Velocity Dependent Constraints 


Constraints of the form g(a“, «4, t) = 0 which cannot be integrated to give f(x4,t) = 0 
are non-holonomic. For example, consider a coin of radius R rolling down a slope as 
shown in figure 7. The coordinates (x,y) fix the coin’s position on the slope. But the 
coin has other degrees of freedom as well: the angle 0 it makes with the path of steepest 
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Figure 7: The coin rolling down a slope leads to velocity dependant, non-holonomic con- 
straints. 


descent, and the angle ¢@ that a marked point on the rim of the coin makes with the 
vertical. If the coin rolls without slipping, then there are constraints on the evolution 
of these coordinates. We must have that the velocity of the rim is Urim = Ré. So, in 
terms of our four coordinates, we have the constraint 


i= Résind f y = Réocosé (2.41) 
But these cannot be integrated to give constraints of the form f(zx,y,6,¢) = 0. They 
are non-holonomic. 


2.3.3 Summary 


Let’s review what we’ve learnt so far. A system is described by n generalised coordinates 
qi which define a point in an n-dimensional configuration space C. Time evolution is a 
curve in C governed by the Lagrangian 


L(qi, dit) (2.42) 
such that the q; obey 
d (OL OL 
= = 2.4 


These are n coupled 2”4 order (usually) non-linear differential equations. Before we 
move on, let’s take this opportunity to give an important definition. The quantity 


OL 
= 2.44 
Pi = 95 (2.44) 
is called the generalised momentum conjugate to qi. (It only coincides with the real 
momentum in Cartesian coordinates). We can now rewrite Lagrange’s equations (2.43) 


as p; = OL/0q;. The generalised momenta will play an important role in Section 4. 


=e 


Note: The Lagrangian L is not unique. We may make the transformation 


L'=aL fra eR 
or L=L+ df (2.45) 
dt 

for any function f and the equations of motion remain unchanged. To see that the last 
statement is true, we could either plug L’ into Lagrange’s equations or, alternatively, 
recall that Lagrange’s equations can be derived from an action principle and the ac- 
tion (which is the time integral of the Lagrangian) changes only by a constant under 
the transformation. (As an aside: A system no longer remains invariant under these 
transformations in quantum mechanics. The number a is related to Planck’s constant, 
while transformations of the second type lead to rather subtle and interesting effects 
related to the mathematics of topology). 


2.3.4 Joseph-Louis Lagrange (1736-1813) 


Lagrange started off life studying law but changed his mind and turned to mathematics 
after reading a book on optics by Halley (of comet fame). Despite being mostly self- 
taught, by the age of 19 he was a professor in his home town of Turin. 


He stayed in Italy, somewhat secluded, for the next 11 years although he commu- 
nicated often with Euler and, in 1766, moved to Berlin to take up Euler’s recently 
vacated position. It was there he did his famous work on mechanics and the calculus of 
variations that we’ve seen above. In 1787 he moved once again, now to Paris. He was 
just in time for the French revolution and only survived a law ordering the arrest of 
all foreigners after the intervention of the chemist Lavoisier who was a rather powerful 
political figure. (One year later, Lavoisier lost his power, followed quickly by his head.) 


Lagrange published his collected works on mechanics in 1788 in a book called “Mechanique 
Analytique”. He considered the work to be pure mathematics and boasts in the intro- 
duction that it contains no figures, thereby putting the anal in analytique. 


Since I started with a quote about Newton’s teaching, Pl include here a comment 
on Lagrange’s lectures by one of his more famous students: 


“His voice is very feeble, at least in that he does not become heated; he 
has a very pronounced Italian accent and pronounces the s like z ... The 
students, of whom the majority are incapable of appreciating him, give him 
little welcome, but the professors make amends for it.” 


Fourier analysis of Lagrange 


= 


2.4 Noether’s Theorem and Symmetries 


In this subsection we shall discuss the appearance of conservation laws in the Lagrangian 
formulation and, in particular, a beautiful and important theorem due to Noether 
relating conserved quantities to symmetries. 


Let’s start with a definition. A function F'(qi, qi, t) of the coordinates, their time 
derivatives and (possibly) time t is called a constant of motion (or a conserved quantity) 
if the total time derivative vanishes 


dF ? /ƏF OF OF 
OF ie: Sg de pd N, 2.46 
di 2 a a) ( ) 


whenever q;(t) satisfy Lagrange’s equations. This means that F remains constant along 
the path followed by the system. Here’s a couple of examples: 


Claim: If L does not depend explicitly on time t (i.e. OL/Ot = 0) then 
OL 
KeS ae 2.47 
2 3G, (2.47) 


is constant. When H is written as a function of q; and p;, it is known as the Hamiltonian. 
It is usually identified with the total energy of the system. 


Proof 


dH „ðL .d(@L\ ƏL, aL, 
ak (oap toa (ap) t a) = 


J 


which vanishes whenever Lagrange’s equations (2.43) hold. 


Claim: Suppose 0L/0q; = 0 for some q;. Then q; is said to be ignorable (or cyclic). 
We have the conserved quantity 


OL 
—— 2.4 
Proof: 
dp; d (OL OL 
= SS m 2: 
T 2a jean 


where we have used Lagrange’s equations (2.43) in the second equality. 
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2.4.1 Noether’s Theorem 
Consider a one-parameter family of maps 
qilt) > Qi(s, t) sER (2.51) 


such that Q;(0,t) = q;(t). Then this transformation is said to be a continuous symmetry 
of the Lagrangian L if 


© LQ t), Quls,t),1) = 0 (2.52) 


Noether’s theorem states that for each such symmetry there exists a conserved quantity. 


Proof of Noether’s Theorem: 


ƏL OL AQ; ƏL OQ; 


ds OQ; Os ` AQ; Os (2.53) 
so we have 
= OL _ OL OQ; _ OL dQ; 
Os|,-9 Odi Əs |s OG 0s A 

_ d (AL\ Qi _ OL dQ; 

odt (a) ðs |o OG Os T (By Lagrange) 

_ d fOL Qi 

= dt & Os e (2.54) 


and the quantity }>,(0L/04;)(0Q;/0s), evaluated at s = 0, is constant for all time. 


Example: Homogeneity of Space 


Consider the closed system of N particles discussed in Section 1 with Lagrangian 
L=4X mit? — V(\ri — rl) (2.55) 


This Lagrangian has the symmetry of translation: r; > r; + sn for any vector n and 
for any real number s. This means that 


L(x, ti, t) = L(ri + sn, ři, t) (2.56) 


This is the statement that space is homogeneous and a translation of the system by 
sn does nothing to the equations of motion. These translations are elements of the 
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Galilean group that we met in section 1.2. From Noether’s theorem, we can compute 
the conserved quantity associated with translations. It is 


v= pen (2.57) 


which we recognise as the the total linear momentum in the direction n. Since this 


holds for all n, we conclude that 5°; p; is conserved. But this is very familiar. It is 
simply the conservation of total linear momentum. To summarise 
Homogeneity of Space = Translation Invariance of L 
=> Conservation of Total Linear Momentum 
This statement should be intuitively clear. One point in space is much the same as any 


other. So why would a system of particles speed up to get over there, when here is just 
as good? This manifests itself as conservation of linear momentum. 


Example: Isotropy of Space 


The isotropy of space is the statement that a closed system, described by the Lagrangian 
(2.55) is invariant under rotations around an axis n, so all r; > r; are rotated by the 
same amount. To work out the corresponding conserved quantities it will suffice to 
work with the infinitesimal form of the rotations 


C= ee Or; 


= r; Tt an x r; (2.58) 


where q is considered infinitesimal. To see that this is indeed a rotation, you could 
calculate the length of the vector and notice that it’s preserved to linear order in a. 
Then we have 


L(ri, t;) = L(x; +an x Fz; r; +an x t;) (2.59) i 
which gives rise to the conserved quantity 
OL 
Soa (ax) =) 5 A- x p) =A-L (2.60) 
y ĉii i 7 


r 


This is the component of the total angular momentum in the direc- 
tion n. Since the vector n is arbitrary, we get the result Figure 8: 


Isotropy of Space = Rotational Invariance of L 


= Conservation of Total Angular Momentum 
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Example: Homogeneity of Time 


What about homogeneity of time? In mathematical language, this means L is invariant 
under t + t+s or, in other words, 0L/Ot = 0. But we already saw earlier in this section 
that this implies H = 5°, q;(0L/0q;) — L is conserved. In the systems we’re considering, 
this is simply the total energy. We see that the existence of a conserved quantity which 
we call energy can be traced to the homogeneous passage of time. Or 


Time is to Energy as Space is to Momentum 


Recall from the lectures on Special Relativity that energy and 3-momentum fit together 
to form a 4-vector which rotates under spacetime transformations. Here we see that 
the link between energy-momentum and time-space exists even in the non-relativistic 
framework of Newtonian physics. You don’t have to be Einstein to see it. You just 
have to be Emmy Noether. 


Remarks: It turns out that all conservation laws in Nature (as described by the 
Standard Model and General Relativity) are related to symmetries through Noether’s 
theorem. This includes the conservation of electric charge and the conservation of 
particles such as protons and neutrons (known as baryons). 


There are also discrete symmetries in Nature which don’t depend on a continuous 
parameter. For example, many theories are invariant under reflection (known as parity) 
in which r; + —r;. These types of symmetries do not give rise to conservation laws in 
classical physics (although they do in quantum physics). 


2.5 Applications 


Having developed all of these tools, let’s now apply them to a few examples. 


2.5.1 Bead on a Rotating Hoop 

This is an example of a system with a time dependent holonomic constraint. The hoop 
is of radius a and rotates with frequency w as shown in figure 9. The bead, of mass m, 
is threaded on the hoop and moves without friction. We want to determine its motion. 
There is a single degree of freedom w, the angle the bead makes with the vertical. In 
terms of Cartesian coordinates (x,y,z) the position of the bead is 


x=asinycoswt , y=asinvsinwt , 2z=a-—acosy (2.61) 


To determine the Lagrangian in terms of the generalised coordinate ~ we must substi- 
tute these expressions into the Lagrangian for the free particle. For the kinetic energy 
T we have 


T = imi + 9° + 2) = imaeh’ +w’ sin’ y] (2.62) 
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while the potential energy V is given by (ignoring an overall 1z 
constant) 


V = mgz = —mgacosw (2.63) 


So, replacing x, y and z by w, we have the Lagrangian 


L = ma? (30? = Var) (2.64) 


where the effective potential is 


1 
Veg = — (—mgacosy — ima*w? sin? Y) (2.65) 


ma? Figure 9: 


We can now derive the equations of motion for the bead simply from Lagrange’s equa- 
tions which read 


Ver 
oy 
Let’s look for stationary solutions of these equations in which the bead doesn’t move 


(i.e solutions of the form 4 = b = 0). From the equation of motion, we must solve 
OVig /Ow = 0 to find that the bead can remain stationary at points satisfying 


y= 


(2.66) 


g sin Y = aw? sin Y cos Y (2.67) 


Unstable Vett Unstable i” : Unstable 


Unstable 


Stable 
Stable 


z 0 z 0 Stable T 


@=0 0<m7< gla > gla 


Figure 10: The effective potential for the bead depends on how fast the hoop is rotating 


There are at most three such points: Y = 0, Y = m or cosy = g/aw. Note that the 
first two solutions always exist, while the third stationary point is only there if the hoop 
is spinning fast enough so that w? > g/a. Which of these stationary points is stable 
depends on whether Veg(w) has a local minimum (stable) or maximum (unstable). This 
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in turn depends on the value of w. Veg is drawn for several values of w in figure 10. 
For w? < g/a, the point Yy = 0 at the bottom of the hoop is stable, while for w? > g/a, 
the position at the bottom becomes unstable and the new solution at cos% = g/aw? is 
the stable point. For all values of w the bead perched at the top of the hoop Y = 7 is 
unstable. 


2.5.2 Double Pendulum 


A double pendulum is drawn in figure 11, consisting of two 
particles of mass mı and mg, connected by light rods of length 
l and ly. For the first particle, the kinetic energy T} and the 
potential energy V; are the same as for a simple pendulum 


Ti = imil?0 and Vı = —mygl, cos 6, (2.68) 


For the second particle it’s a little more involved. Consider the 
position of the second particle in the (x,y) plane in which the 
pendulum swings (where we take the origin to be the pivot of the 


Figure 11: 


first pendulum with y increasing downwards) 
vg = lh sin 0 +lpsinO, and Ya = lı cos 01 + lə cos b2 (2.69) 
Which we can substitute into the kinetic energy for the second particle 


To = im(3? + y’) 
1 
2 


= im, (Ri + 1262 + 2l lz cos(O, — 02)6162) (2.70) 
while the potential energy is given by 
Vo = —Mogy2 = —Maq (lı cos 6, + lə cos 62) (2.71) 


The Lagrangian is given by the sum of the kinetic energies, minus the sum of the 
potential energies 


L= z(miı + mə)l30? + 1 mal305 F Malqly cos(0ı = 02)010> 
+(m, + m2)gli cos 6, + M2glz cos 02 (2°79) 


The equations of motion follow by simple calculus using Lagrange’s two equations (one 
for 0, and one for 02). The solutions to these equations are complicated. In fact, above 
a certain energy, the motion is chaotic. 
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2.5.3 Spherical Pendulum 


The spherical pendulum is allowed to rotate in three dimen- 
sions. The system has two degrees of freedom drawn in figure 
12 which cover the range 


0<O0<n7 and O<¢d<27 (2.73) 


In terms of cartesian coordinates, we have 


x=Icos¢sind , y=lsingdsind , z= —lcos0 


We substitute these constraints into the Lagrangian for a free 
particle to get 


Figure 12: 
L= im(a* +4? + 27) —mgz 
= tml? (6 + $? sin? 0) + mgl cos 6 (2.74) 
Notice that the coordinate ¢ is ignorable. From Noether’s theorem, we know that the 
quantity 
OL : 
J = — =ml’dsin? 6 (2.75) 
do 


is constant. This is the component of angular momentum in the @ direction. The 
equation of motion for 0 follows from Lagrange’s equations and is 


ml?6 = ml?¢? sin 6 cos 0 — mgl sin 0 (2.76) 


We can substitute ¢ for the constant J in this expression to get an equation entirely in 
terms of 6 which we chose to write as 


re 2, 
70 (2.77) 
where the effective potential is defined to be 
g aca noel 
Veg (0) = 7 C050 EE sind (2.78) 


An important point here: we must substitute for J into the equations of motion. If 
you substitute J for ¢ directly into the Lagrangian, you will derive an equation that 
looks like the one above, but you’ll get a minus sign wrong! This is because Lagrange’s 
equations are derived under the assumption that 0 and @ are independent. 
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0, t) t) 8 


Figure 13: The effective potential for the spherical pendulum. 


As well as the conservation of angular momentum J, we also have 0L/Ot = 0 so 
energy is conserved. This is given by 


E = 16? + Væ(0) (2.79) 


where F is a constant. In fact we can invert this equation for Æ to solve for 0 in terms 
of an integral 


t — to 


S f el (2.80) 
V2) JE — Væl0) ) 

If we succeed in writing the solution to a problem in terms of an integral like this then 

we say we’ve “reduced the problem to quadrature”. It’s kind of a cute way of saying 

we can’t do the integral. But at least we have an expression for the solution that we 

can play with or, if all else fails, we can simply plot on a computer. 


Once we have an expression for 6(t) we can solve for ¢(t) using the expression for 
J, 


ga fe Hf a 
J msin?h Vme VE — Ver(0) sin? 0 


which gives us ¢ = ¢(0) = $(t). Let’s get more of a handle on what these solutions 
look like. We plot the function Veg in figure 13. For a given energy E, the particle is 
restricted to the region Vag < E (which follows from (2.79)). So from the figure we 
see that the motion is pinned between two points 6; and 69. If we draw the motion of 
the pendulum in real space, it must therefore look something like figure 14, in which 
the bob oscillates between the two extremes: 6; < 0 < bə. Note that we could make 
more progress in understanding the motion of the spherical pendulum than for the 
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double pendulum. The reason for this is the existence of two conservation laws for the 
spherical pendulum (energy and angular momentum) compared to just one (energy) 
for the double pendulum. 


There is a stable orbit which lies between the two 
extremal points at 0 = o, corresponding to the minimum ae 


of Vag. This occurs if we balance the angular momentum J 
and the energy E just right. We can look at small oscilla- 
tions around this point by expanding 0 = 6) + 60. Substi- 
tuting into the equation of motion (2.77), we have 


> OV ee 
= ( 802 


) 60 + O(607) (2.81) 


0=0 


so small oscillations about 6 = 69 have frequency w? = 
(8? Væ /00°) evaluated at 0 = 6o. Figure 14: 


2.5.4 Two Body Problem 


We now turn to the study of two objects interacting through a central force. The most 
famous example of this type is the gravitational interaction between two bodies in 
the solar system which leads to the elliptic orbits of planets and the hyperbolic orbits 
of comets. Let’s see how to frame this famous physics problem in the Lagrangian 
setting. We start by rewriting the Lagrangian in terms of the centre of mass R and 
the separation rig = rı — r and work with an arbitrary potential V (|r12]) 


L = mi? + mi3 = V(|ri2l) 


(mı + m2)R? + ipii — V(|rial) (2.82) 


1 
2 
1 
2 


where u = mMymM2/(m, + Mo) is the reduced mass. The Lagrangian splits into a piece 
describing the centre of mass R and a piece describing the separation. This is familiar 
from Section 1.3.2. From now on we neglect the centre of mass piece and focus on the 
separation. We know from Noether’s theorem that L = r12 X pı2 is conserved, where 
Piz is the momentum conjugate to r12. Since L is perpendicular to r12, the motion of 
the orbit must lie in a plane perpendicular to L. Using polar coordinates (r, @) in that 
plane, the Lagrangian is 


L= tyr? +r’ g) — V(r) (2.83) 


To make further progress, notice that @ is ignorable so, once again using Noether’s 
theorem, we have the conserved quantity 


J = pro (2.84) 
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This is also conservation of angular momentum: to reduce to the Lagrangian (2.83), we 
used the fact that the direction of L is fixed; the quantity J is related to the magnitude 
of L. To figure out the motion we calculate Lagrange’s equation for r from (2.83) 


d (OL OL : a OV | 
a (oe) -F -u-u ta (2.85) 


We can eliminate ¢ from this equation by writing it in terms of the constant J to get 
a differential equation for the orbit purely in terms of r, 


. ð 
pi = Voor) (2.86) 


where the effective potential is given by 


J? 
Qur? 


Vea(r) = V(r) + (2.87) 


The last term is known as the “angular momentum barrier”. Let me reiterate the 
warning of the spherical pendulum: do not substitute J = pr2¢ directly into the 
Lagrangian — you will get a minus sign wrong! You must substitute it into the equations 
of motion. 


Vete 4 


hyperbolic orbit 


elliptic orbit 


a circular orbit 


Figure 15: The effective potential for two bodies interacting gravitationally. 


So far, you may recognise that the analysis has been rather similar to that of the 
spherical pendulum. Let’s continue following that path. Since OL/ðt = 0, Noether 
tells us that energy is conserved and 


E = Lui? + Veg(r) (2.88) 


== 


is constant throughout the motion. We can use this fact to “reduce to quadrature’ , 


t—t) = yJ = T (2.89) 


Up to this point the analysis is for an arbitrary potential V(r). At this point let’s 


specialise to the case of two bodies interacting gravitationally with 


Gmime 


V(r) = (2.90) 


where G is Newton’s constant. For this potential, the different solutions were studied 
in your Part I mechanics course where Kepler’s laws were derived. The orbits fall into 
two categories: elliptic if Æ < 0 and hyperbolic if Æ > 0 as shown in figure 15. 


It’s worth noting the methodology we used to solve this problem. We started with 
6 degrees of freedom describing the positions of two particles. Eliminating the centre 
of mass reduced this to 3 degrees of freedom describing the separation. We then used 
conservation of the direction of L to reduce to 2 degrees of freedom (r and @¢), and 
conservation of the magnitude of L to reduce to a single variable r. Finally conservation 
of E allowed us to solve the problem. You might now be getting an idea about how 
important conservation laws are to help us solve problems! 


2.5.5 Restricted Three Body Problem 


Consider three masses m1, M2 and m3 interacting gravitationally. In general this prob- 
lem does not have an analytic solution and we must resort to numerical methods (i.e. 
putting it on a computer). However, suppose that m3 < mı and mg. Then it is a good 
approximation to first solve for the motion of mı and mə interacting alone, and then 
solve for the motion of ms in the time dependent potential set up by mı and mə. Let’s 
see how this works. 


For simplicity, let’s assume mı and mg are in a circular orbit with ¢ = wt. We saw 
in the previous section that the circular orbit occurs for OVeg/Or = 0, from which we 
get an expression relating the angular velocity of the orbit to the distance 

ies cum me) (2.91) 
which is a special case of Kepler’s third law. Let’s further assume that ms moves in the 
same plane as mı and mz (which is a pretty good assumption for the sun-earth-moon 
system). To solve for the motion of m3 in this background, we use our ability to change 
coordinates. Let’s go to a frame which rotates with mı and mə with the centre of 
mass at the origin. The particle m; is a distance ru/m from the origin, while ms is a 
distance ru/m from the origin. 
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Then, from the example of Section 2.2.1, the Ly 
Lagrangian for m3 in the rotating frame is 


rp/m, rwm, 


L = im; [(« — wy)? + (y +wa)?] -V 


where V is the gravitational potential for ms inter- 
acting with mı and mz 


Gmyms3 Gmo2ms3 


V = 


2.92 
r13 T23 ( ) Figure 16: 


The separations are given by 
ris = (£ +ru/m) +y , ra =(z-ru/m) +y’ (2.93) 


Be aware that xz and y are the dynamical coordinates in this system, while r is the 
fixed separation between m; and mz. The equations of motion arising from L are 


s : 2 OV 
Mt = 2m3wy + Mzw rt — —— 
Ox 
OV 
msi = —2mzwi + mw y — Dy (2.94) 
y 


The full solutions to these equations are interesting and complicated. In fact, in 1889, 
Poincaré studied the restricted three-body system and discovered the concept of chaos 
in dynamical systems for the first time (and, in the process, won 2,500 krona and lost 
3,500 krona). We’ll be a little less ambitious here and try to find solutions of the form 
z =y=0. This is where the third body sits stationary to the other two and the whole 
system rotates together. Physically, the centrifugal force of the third body exactly 
cancels its gravitational force. The equations we have to solve are 


OV — 

Mw?’ r = — = Gmm,” = e + Gm m3— na (2.95) 
Ox T13 T23 
OV 

Mwy = — = Gmym3— + Omama (2.96) 
Oy T13 T23 


There are five solutions to these equations. Firstly suppose that y = 0 so that ms sits 
on the same line as mı and m2. Then we have to solve the algebraic equation 


z +ru/m z—ru/mz 


2 
w'r = Gm ——— O 
‘x + rpm “|x — rp/mal 


(2.97) 


In figure 17, we have plotted the LHS and RHS of this equation to demonstrate the 
three solutions, one in each of the regimes: 
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—rp/m, rwm, 


Figure 17: The three solutions sitting on y = 0. 


r T r 
E<-— , -—<@<— , r> — (2.98) 
my my mg mg 


Now let’s look for solutions with y 4 0. From (2.96) we have 


Gmo 2> Gm 
ae ca (2.99) 
23 13 


which we can substitute into (2.95) and, after a little algebra, we find the condition for 


solutions to be 
G G 
we (mı + mə) _ (mı + mə) (2.100) 
3 3 
"3 T23 


which means that we must have r13 = r23 = r. There are two such points. 


In general there are five stationary points drawn L4 
in the figure. These are called Lagrange points. It 
turns out that Lı, La and L3 are unstable, while 
L, and L; are stable as long as mz is sufficiently 
less than mı. 


For the earth-sun system, NASA and ESA make 
use of the Lagrange points Ly and L3 to place 
satellites. There are solar observatories at L3; 
satellites such as WMAP and PLANCK which 
measure the cosmic microwave background radi- 
ation (the afterglow of the big bang) gather their 
data from Lə. Apparently, there is a large collec- 
tion of cosmic dust which has accumulated at L4 and L5. Other planetary systems (e.g. 
the sun-jupiter and sun-mars systems) have large asteroids, known as trojans, trapped 
at their L4 and Ls. 


Figure 18: The five Lagrange points. 
X marks the spots. 


= 35:= 


2.5.6 Purely Kinetic Lagrangians 


Often in physics, one is interested in systems with only kinetic energy and no potential 
energy. For a system with n dynamical degrees of freedom qg*, a = 1,...,n, the most 
general form of the Lagrangian with just a kinetic term is 


L= 5 Jabe) cP (2.101) 


The functions gab = gba depend on all the generalised coordinates. Assume that 
det(gas) Æ 0 so that the inverse matrix g% exists (g?’gsc = 6%.). It is a short exer- 
cise to show that Lagrange’s equation for this system are given by 


G+ Ty.d?¢° = 0 (2.102) 
where 
Oged OJed OJo 
a 1 „ad C 
— = 2.103 
bc 2 g ( ôq: + ôq? dqt ( ) 


The functions ga» define a metric on the configuration space, and the equations (2.102) 
are known as the geodesic equations. They appear naturally in General Relativity 
where they describe a particle moving in curved spacetime. Lagrangians of the form 
(2.101) also appear in many other areas of physics, including the condensed matter 
physics, the theory of nuclear forces and string theory. In these contexts, the systems 
are referred to as sigma models. 


2.5.7 Particles in Electromagnetic Fields 


We saw from the beginning that the Lagrangian formulation works with conservative 
forces which can be written in terms of a potential. It is no good at dealing with friction 
forces which are often of the type F = —kz. But there are other velocity dependent 
forces which arise in the fundamental laws of Nature. It’s a crucial fact about Nature 
that all of these can be written in Lagrangian form. Let’s illustrate this in an important 
example. 


Recall that the electric field E and the magnetic field B can be written in terms of 
a vector potential A (r,t) and a scalar potential ¢(r, t) 


B=VxA , E=-Vo¢-— (2.104) 
Let’s study the Lagrangian for a particle of electric charge e of the form, 


L=ijmr? —e(¢—t- A) (2.105) 


2 
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The momentum conjugate to r is 
OL 
= — = mt A 2.106 
p=5 =m +e ( ) 
Notice that the momentum is not simply mr; it’s modified in the presence of electric 
and magnetic fields. Now we can calculate Lagrange’s equations 


d (OL OL d, ., ’ 
Ti ($) = (mr +eA)+eVọ-—-eV(tr-A)=0 (2.107) 


To disentangle this, let’s work with indices a,b = 1,2,3 on the Cartesian coordinates, 
and rewrite the equation of motion as 


P Op OA, OA, OA,\., 
a = 2.1 
A (3 OF ) ve G mt) a 
Now we use our definitions of the E and B fields (2.104) which, in terms of indices, 
read 
Op Aa A, 
= = = —— .1 
E, dra Ət , B. Ecab dra (2 09) 


The last of these can be equivalently written as 0A,/Or* — OA,/Or° = €qtcBe. The 
equation of motion then reads 


mi = eE, + Eca?’ Be 
or, reverting to vector notation, 
mr =e(E+r xB) (2.110) 
which is the Lorentz force law. 


Gauge Invariance: The scalar and vector potentials are not unique. We may make 
a change of the form 


o> p-<— , A>A+HVX (2.111) 


These give the same E and B fields for any function x. This is known as a gauge 
transformation. Under this change, we have 


Ox dx 
2A ; = 2 2.112 
Labre TE Vx Dee ( ) 


but we know that the equations of motion remain invariant under the addition of a 
total derivative to the Lagrangian. This concept of gauge invariance underpins much of 
modern physics. You can learn more about this in the lectures on Electromagnetism. 
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2.6 Small Oscillations and Stability 
“Physics is that subset of human experience which can be reduced to cou- 


pled harmonic oscillators” Aichwel Peskin 


Peskin doesn’t say this to knock physics. He’s just a fan of harmonic oscillators. And 
rightly so. By studying the simple harmonic oscillator and its relatives in ever more 
inventive ways we understand why the stars shine and why lasers shine and, thanks to 
Hawking, why even black holes shine. 


In this section we’ll see one reason why the simple harmonic oscillator is so important 
to us. We will study the motion of systems close to equilibrium and see that the 
dynamics is described by n decoupled simple harmonic oscillators, each ringing at a 
different frequency. 


Let’s start with a single degree of freedom x. We’ve already seen several examples 
where we get an equation of the form 


ï= f(x) (2.113) 


An equilibrium point, x = xo, of this system satisfies f(x9) = 0. This means that if we 
start with the initial conditions 


Say: and t=0 (2.114) 


then the system will stay there forever. But what if we start slightly away from £ = zọ? 
To analyse this, we write 


x(t) = zo + n(t) (2.115) 
where 77 is assumed to be small so that we can Taylor expand f(x) to find 
i= f'(zo)n + O(n") (2.116) 


and we neglect the terms quadratic in 7 and higher. There are two possible behaviours 
of this system 


1. f'(ao) < 0. In this case the restoring force sends us back to 7 = 0 and the solution 
is 


n(t) = Acos(w(t — to)) (2.117) 


where A and to are integration constants, while w? = —f’(2o). The system 
undergoes stable oscillations about x = x at frequency w. 
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2. f(a) > 0. In this case, the force pushes us away from equilibrium and the 
solution is 


n(t) = Ae™ + Be (2.118) 


where A and B are integration constants, while A? = f’(xo). In this case, there 
is a very special initial condition A = 0 such that x —> x9 at late times. But for 
generic initial conditions, 7 gets rapidly large and the approximation that 7 is 
small breaks down. We say the system has a linear instability. 


Now let’s generalise this discussion to n degrees of freedom with equations of motion 
of the form, 


An equilibrium point q? must satisfy f;(q?,...,q°) = 0 for all i = 1...,n. Consider 
small perturbations away from the equilibrium point 


qilt) = q? + n;(t) (2.120) 


where, again, we take the 7; to be small so that we can Taylor expand the f;, and 
neglect the quadratic terms and higher. We have 


_., Of 


ðq; qk=4), 
where the sum over j = 1,...,n is implicit. It’s useful to write this in matrix form. 
We define the vector 7 and the n x n matrix F as 
af af 
n=|: . F= (2.122) 
afn afn 


where each partial derivative in the matrix F is evaluated at q; = q?. The equation 
now becomes simply 


ü= Fn (2.123) 


Our strategy is simple: we search for eigenvectors of F. If F were a symmetric matrix, 
it would have a complete set of orthogonal eigenvectors with real eigenvalues. Unfortu- 
nately, we can’t assume that F is symmetric. Nonetheless, it is true that for equations 
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of the form (2.123) arising from physical Lagrangian systems, the eigenvalues will be 
real. We shall postpone a proof of this fact for a couple of paragraphs and continue 
under the assumption that F has real eigenvalues. In general, F will have different left 
and right eigenvectors, 


Fu, =u, «. ĞÖF=X C a=1,...,n (2.124) 


where there’s no sum over a in these equations. The left and right eigenvectors satisfy 
Ċa ` Hy = Sav. Note that although the eigenvectors differ, the eigenvalues \? for a = 
1,...,n are the same. Although 2 are real for the physical systems of interest (to be 
proved shortly) they are not always positive. The most general solution to 7 = F'n is 


n(t) = X. pa [Aae + Bae] (2.125) 


where Aa and Ba are 2n integration constants. Again, we have two possibilities for 
each eigenvalue 


1. A? < 0 In this case +A, = iwa for some real number wa. The system will be stable 
in the corresponding direction N = Ha. 


2. \2 > 0. Now +A are real and the system exhibits a linear instability in the 
direction N = Ha 


The eigenvectors u, are called normal modes. The equilibrium point is only stable if 
2 < 0 for every a = 1,...,n. If this is the case the system will typically oscillate 
around the equilibrium point as a linear superposition of all the normal modes, each 
at a different frequency. 


To keep things real, we can write the most general solution as 


H(t) = Ha [Aae + Bae ™™*] + > [Aa COS(Wa(t — ta)) (2.126) 


a,A2>0 a,A2. <0 
where now A,, B, and t, are the 2n integration constants. 


The Reality of the Eigenvalues 


Finally, let’s show what we put off above: that the eigenvalues \? are real for matrices 
F derived from a physical Lagrangian system. Consider a general Lagrangian of the 
form, 


L = 5T3(g) aid; — V (a) (2.127) 
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We will require that T;;(q) is invertible and positive definite for all q. Expanding about 
an equilibrium point as in (2.120), to linear order in 7; the equations read 


Tigh = —Vigny (2.128) 


where T;; = Ti;(q°) and V;; = 0°V/0q0q;, evaluated at q; = g?. Then in the ma- 
trix notation of (2.123), we have F = —T~!V. Both T;; and V;; are symmetric, but 
not necessarily simultaneously diagonalisable. This means that Fj; is not necessarily 
symmetric. Nevertheless, F does have real eigenvalues. To see this, look at 


Fu=Xu > Ve=—0Tp (2.129) 


So far, both u and à? could be complex. We will now show that they’re not. Take 
the inner product of this equation with the complex conjugate eigenvector 2. We have 
ju-Vu=? p-Tp. But for any symmetric matrix S, the quantity A- Sp is real. (This 
follows from expanding u in the complete set of real, orthogonal eigenvectors of S, each 
of which has a real eigenvalue). Therefore both a -Vp and A -Tp are both real. Since 
we have assumed that T is invertible and positive definite, we know that ø -Tpu 4 0 
so, from (2.129), we conclude that the eigenvalue A? is indeed real. 


2.6.1 Example: The Double Pendulum 


In section 2.5.2, we derived the Lagrangian for the double pendulum. Restricting to 
the case where the two masses are the same mı = m2 = m and the two lengths are the 
same lı = l2 = l, we derived the Lagrangian (2.72) for arbitrary oscillations 


L = m??? + im? + ml? cos(0; — 02)ĝ1ĝ2 + 2mgl cos 6, + mgl cos bə 


The stable equilibrium point is clearly 6; = 62 = 0. (You could check mathematically if 
you’re dubious). Let’s expand for small 6; and 6. If we want to linearise the equations 
of motion for 0, then we must expand the Lagrangian to second order (so that after we 
take derivatives, there’s still a @ left standing). We have 


L ~ mi?6? + Imo + ml?6,6. — mgl6? — imglO3 (2.130) 


where we’ve thrown away an irrelevant constant. From this we can use Lagrange’s 
equations to derive the two linearised equations of motion 


2ml?6, + ml?6> = —2mglO, 
mls + ml, = —mglOo (2.131) 


Or, writing 0 = (01, 02)”, this becomes 
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Normal Mode 1 Normal Mode 2 


We have two eigenvectors. They are 


1 
Lj = (i) which has eigenvalue A? = —(g/l)(2 — V2). This corresponds to 


the motion shown in figure 19 for the first normal mode. 


1 
2. fig = ( a which has eigenvalue \3 = —(g/1)(2 + V2). This corresponds to 
the motion shown in figure 19 for the second normal mode. 


We see that the frequency of the mode in which the two rods oscillate in different 
directions should be higher than that in which they oscillate together. 


2.6.2 Example: The Linear Triatomic Molecule 


Consider the molecule drawn in the figure. It’s a rough 7 ~ 
approximation of CO2. We’ll only consider motion in the © SSS S 
direction parallel to the molecule for each atom, in which ` 


case the Lagrangian for this object takes the form, Figure 20: 


L = mti + 4M3 + imt} — V(x — x2) — V (£2 — £3) (2.133) 


The function V is some rather complicated interatomic potential. But, the point of this 


section is that if we’re interested in oscillations around equilibrium, this doesn’t matter. 
0 


Seada -libri OO) ln y0 
Assume that x; = x? in equilibrium. By symmetry, we have |x} — x5| = |z5 — 23] = ro. 


We write deviations from equilibrium as 


a,j(t) = x? +n,(t) (2.134) 


=A? = 


Normal Mode 1 Normal Mode 2 Normal Mode 3 


m a = ee) --= 


= > <= —> < —> 


Figure 21: The three normal modes of the triatomic molecule 


Taylor expanding the potential about the equilibrium point, 
1 OV 


: Or? r=Pr 


V(r) = V(ro) + — (r — ro) + 


T=rg 


(r= ro) +... (2.135) 
0 
Here the first term V (rg) is a constant and can be ignored, while the second term OV /Or 
vanishes since we are in equilibrium. Substituting into the Lagrangian, we have 


L~ gmat + 3M3 + gms — 5 [(m — m)? + m- ns)’] (2.136) 


where k = 0°V/0r? evaluated at r = ro. Then the equations of motion are 


mija 1 — N2 
Mi | =—k | (m2 —m) + (m — ns) (2.137) 
Mija n3 — N2 


or, putting it in the form 7 = Fn, we have 


—k/m k/m 0 
F=| k/M -2k/M k/M (2.138) 
0 k/m —k/m 


Again, we must look for eigenvectors of F. There are three: 


1. w = (1,1, 1)? which has eigenvalue Aj = 0. But this is just an overall translation 
of the molecule. It’s not an oscillation. 


2. py = (1,0,-1)? which has eigenvalue \3 = —k/m. In this motion, the outer 
two atoms oscillate out of phase, while the middle atom remains stationary. The 
oscillation has frequency w = ,/k/m. 


3. fs = (1,-2m/M,1)" which has eigenvalue à} = —(k/m)(1+2m/M). This 
oscillation is a little less obvious. The two outer atoms move in the same direction, 
while the middle atom moves in the opposite direction. The frequency of this 
vibration w3 = VESE is greater than that of the second normal mode. 
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All motions are drawn in figure 21. For small deviations from equilibrium, the most 
general motion is a superposition of all of these modes. 


n(t) = (A + Bt) + WC cos(we(t — t2)) + 3D cos(w3(t — t3)) (2.139) 
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3. The Motion of Rigid Bodies 


Figure 22: Wolfgang Pauli and Niels Bohr stare in wonder at a spinning top. 


Having now mastered the technique of Lagrangians, this section will be one big 
application of the methods. The systems we will consider are the spinning motions of 
extended objects. As we shall see, these can often be counterintuitive. Certainly Pauli 
and Bohr found themselves amazed! 


We shall consider extended objects that don’t have any internal 
degrees of freedom. These are called “rigid bodies”, defined to be 
a collection of N points constrained so that the distance between 
the points is fixed. i.e. 


|r; — rj| = constant (3.1) 


a ; ; Figure 23: 
for all i,j = 1,..., N. A simple example is a dumbbell (two masses 


connected by a light rod), or the pyramid drawn in the figure. In both cases, the 
distances between the masses is fixed. 
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Often we will work with continuous, rather than discrete, bodies simply by replacing 
X; mi > f dr p(r) where p(r) is the density of the object. A rigid body has six degrees 
of freedom 


3 Translation + 3 Rotation 


The most general motion of a free rigid body is a translation plus a rotation about 
some point P. In this section we shall develop the techniques required to describe this 
motion. 


3.1 Kinematics 


Consider a body fixed at a point P. The most general allowed motion is a rotation 
about P. To describe this, we specify positions in a fired space frame {€,} by embedding 
a moving body frame {e,} in the body so that {e,} moves with the body. 


ey (ty) 


a 
w 
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A 


time 


e 3 (ty) 


Figure 24: The fixed space frame and the moving body frame. 


Both axes are orthogonal, so we have 
Ča: p = Ôab , ealt) - e,(t) = dap (3.2) 
We will soon see that there is a natural choice of the basis {e,} in the body. 
Claim: For all t, there exists a unique orthogonal matrix R(t) with components R(t) 
such that ealt) = Ray(t)e, 


Proof: ea: € = da => RacRod@e > Ča = Oa = RacRse = Oa» Or, in other words, 
(R?R)a = Jap which is the statement that R is orthogonal. The uniqueness of R 
follows by construction: Rab = €a ` ep. 
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So as the rigid body rotates it is described by a time dependent orthogonal 3 x 3 
matrix R(t). This matrix also has the property that its determinant is 1. (The other 
possibility is that its determinant is —1 which corresponds to a rotation and a reflection 
€a — —e,). Conversely, every one-parameter family R(t) describes a possible motion 
of the body. We have 


C = Configuration Space = Space of 3 x 3 Special Orthogonal Matrices = SO(3) 
A 3 x 3 matrix has 9 components but the condition of orthogonality RTR = 1 imposes 
6 relations, so the configuration space C is 3 dimensional and we need 3 generalised 


coordinates to parameterise C. We shall describe a useful choice of coordinates, known 
as Euler angles, in section 3.5. 


3.1.1 Angular Velocity 


Any point r in the body can be expanded in either the space frame or the body frame: 


r(t) =7,(t)€, in the space frame 
= fa €alt) in the body frame (3.3) 


where 7;(t) = ra R(t). Taking the time derivative, we have 


a ae e in th fram 
ae ae e space frame 
= fa sa in the body frame 
acyl g (3.4) 


Alternatively, we can ask how the body frame basis itself changes with time, 


dea S aRap Se dRab 


= Ree c = Wac€c 3.9 
a a ee) 


where, in the last equality, we have defined wac = Ral Roe = RaRa using the fact 
that RTR =1. 


Claim: Wac = —Wea i.e. w is antisymmetric. 


Proof: RaRo — Oac > RaRa + RaRa = 05s Wae T Ulea = 0 
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Since Wac is antisymmetric, we can use it to define an object with a single index 
(which we will also call w) using the formula 


Wa = S€abeWbe (3.6) 


so that w3 = w12 and so on. We treat these wa as the components of a vector in the 
body frame, SO w = Waea. Then finally we have our result for the change of the body 
frame basis with time 


de, 
dt 


where, in the second equality, we have used the fact that our body frame axis has 
a “right-handed” orientation, meaning €a X €p = Egbhc@c. The vector w is called the 


= —EqpcWpOc = W X Ea (3:7) 


instantaneous angular velocity and its components wa are measured with respect to the 
body frame. 


Since the above discussion was a little formal, let’s draw a picture i 
to uncover the physical meaning of w. Consider a displacement of a 
given point r in the body by rotating an infinitesimal amount dọ about 
an axis n. From the figure, we see that |dr| = |r| désin@. Moreover, 
this displacement is perpendicular to r since the distance to P is fixed q 
by the definition of a rigid body. So we have 


dr = dọ x r (3.8) 
with dọ = hdd. “Dividing” this equation by dt, we have the result Figure 25: 
a 3 (3.9) 


where w = dd/dt is the instantaneous angular velocity. In general, both the axis of 
rotation n and the rate of rotation d@/dt will change over time. 


Aside: One could define a slightly different type of angular velocity by looking at 
how the space frame coordinates 7,(t) change with time, rather than the body frame 
axes €a. Since we have 7,(t) = raRablt), performing the same steps as above, we have 


ry = TaRab = F(R R)a (3.10) 


which tempts us to define a different type of angular velocity, sometimes referred to as 
“convective angular velocity” by Qab = Ra Rep which has the R~! and R in a different 
order. Throughout our discussion of rigid body motion, we will only deal with the 
original w = RRo!. 
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3.1.2 Path Ordered Exponentials 


In the remainder of this chapter, we will devote much effort to determine the angular 
velocity vector w(t) of various objects as they spin and turn. But how do we go from 
this to the rotation R(t)? As described above, we first turn the vector w = wae, into a 
3 x 3 antisymmetric matrix Wa, = EabcWe- Then, from this, we get the rotation matrix 
R by solving the differential equation 
dR 
= — kR 3.11 

O= H omy 
If w and R were scalar functions of time, then we could simply integrate this equation 
to get the solution 


R(t) = exp ( / * w(t!) i) (3.12) 


which satisfies the initial condition R(0) = 1. But things are more complicated because 
both w and R are matrices. Let’s first describe how we take the exponential of a matrix. 
This is defined by the Taylor expansion. For any matrix M, we have 


exp(M) =1+M+4+3M?+... (3.13) 


As our first guess for the solution to the matrix equation (3.11), we could try the scalar 
solution (3.12) and look at what goes wrong. If we take the time derivative of the 
various terms in the Taylor expansion of this putative solution, then problems first 
arise when we hit the 5M 2 type term. The time derivative of this reads 


(fae a) = 5 u(t) (fw ir) +5 (fw i) w(t) (3.14) 


We would like to show that R = wR. The first term on the right-hand side looks 
good since it appears in the Taylor expansion of wR. But the second term isn’t right. 
The problem is that we cannot commute w(t) past w(t’) when t Æ t. For this reason, 
equation (3.12) is not the solution to (3.11) when w and R are matrices. But it does 
give us a hint about how we should proceed. Since the problem is in the ordering of 
the matrices, the correct solution to (3.11) takes a similar form as (3.12), but with a 
different ordering. It is the path ordered exponential, 


R(t) = P exp ( / l w(t’) i’) (3.15) 
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where the P in front means that when we Taylor expand the exponential, all matrices 
are ordered so that later times appear on the left. In other words 


R(t) =1+ [ow N dt + +f fo W w(t") dt/at” + (3.16) 


The double integral is taken over the range 0 < t < t” < t. If we now differentiate 
this double integral with respect to t, we get just the one term w(t ) ( fol w(t’) ) at’), 
instead of the two that appear in (3.14). It can be checked that the higher terms in the 
Taylor expansion also have the correct property if they are ordered so that matrices 
evaluated at later times appear to the left in the integrals. This type of path ordered 
integral comes up frequently in theories involving non-commuting matrices, including 
the standard model of particle physics. 


As an aside, the rotation matrix R is a member of the Lie group SO(3), the space of 
3 x 3 orthogonal matrices with unit determinant. The antisymmetric angular velocity 
matrix w, corresponding to an instantaneous, infinitesimal rotation, lives in the Lie 
algebra so(3). 


3.2 The Inertia Tensor 


Let’s look at the kinetic energy for a rotating body. We can write 
TZ > mir? 
= Sat x ri) (w x ri) 
=: a ((w-w)(r;- ri) — (ri -w)*) (3.17) 
Or, in other words, we can write the kinetic energy of a rotating body as 
T= SWalanwp (3.18) 


where Tap, a,b = 1,2,3 are the components of the inertia tensor measured in the body 
frame, defined by 


ly aaa ri)ðab — (Ti)a(Ti)o) (3.19) 


Note that Ia = Ip, so the inertia tensor is symmetric. Moreover, the components 
are independent of time since they are measured with respect to the body frame. For 
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continuous bodies, we have the analogous expression 


y? +2? —ry -rz 
F= er plr) —ry +27 —yz (3.20) 
Le —yz x+y? 


Since la is a symmetric real matrix, we can diagonalise it. This means that there 
exists an orthogonal matrix O such that OTOT = I' where I’ is diagonal. Equivalently, 
we can rotate the body frame axis {ea} to coincide with the eigenvectors of J (which 
are {Oe,}) so that, in this frame, the inertia tensor is diagonal. These preferred body 
axes, in which J is diagonal, are called the principal axes. In this basis, 


I= i (3.21) 


The eigenvalues J, are called the principal moments of inertia. The kinematical prop- 
erties of a rigid body are fully determined by its mass, principal axes, and moments of 
inertia. Often the principal axes are obvious by symmetry. 


Claim: The J, are real and positive. 


Proof: If c is an arbitrary vector, then 
ce = So mi(r7c? —(r;,-c)*)>0 (3.22) 


with equality only if all the r; lie on a line. If c is the at! eigenvector of J then this 


result becomes I,,c%c’ = I,|c|? which tells us Ia > 0. 


Example: The Rod 


Consider the inertia tensor of a uniform rod of length l and mass M about its centre. 
The density of the rod is p = M/L. By symmetry, we have I = diag(J;, /,,0) where 


1/2 1 
h= f pz?’ dz = pM? (3.23) 


— p] S 


Example: The Disc 


Now consider a uniform disc of radius r and mass M. e3=z 
We take the z axis to be perpendicular to the disc and mea- 
sure J about its centre of mass. Again we know that J = 
diag(I,, I2, I3). The density of the disc is p = M/rr?, so we 


have 
n= | pea ; h= | për i 


so Jı = Ig by symmetry, while 


Sysy 


Figure 26: 


I3 = f p(x? + y?)d*x 
Therefore 
B=h+h= anp | rdr = SMa (3.24) 
0 


So the moments of inertia are I, = Ip = iMr’ and I; = iMr’. 


3.2.1 Parallel Axis Theorem 


The inertia tensor depends on what point P in the body is held fixed. In general, if we 
know I about a point P it is messy to compute it about some other point P’. But it 
is very simple if P happens to coincide with the centre of mass of the object. 


Claim: If P’ is displaced by c from the centre of mass, then 


(Ic)ab = Cen + M (e fab md CaCp) (3.25) 


Proof: 
(Ie)ab = es m; { (r; — c)? ða — (ri — ©)a(¥i — €)o} (3.26) 
= Z Mi {ri 5ab a (ri)a(Vido + [—2r; ý COab + (ri )aCp + (ri)bCa] + (Pab mra Caco) } 


But the terms in square brackets that are linear in r; vanish if r; is measured from the 


centre of mass since $`; mr; = 0. 


The term M (csab — Caco) is the inertia tensor we would find if the whole body was 
concentrated at the centre of mass. 
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Example: The Rod Again 
The inertia tensor of the rod about one of its ends is Iı = 4M? + M(1/2)? = 3MP. 


Example: The Disc Again 


Consider measuring the inertia tensor of the disc about a point displaced by c = (c, 0, 0) 
from the centre. We have 


Ble 


Ble 


Figure 27: 


= M ir? +e 
an: R2 
ar +¢ 
3.2.2 Angular Momentum 


The angular momentum L about a point P can also be described neatly in terms of 
the inertia tensor. We have 
L= X miri X ři 


= X mr; x (w X r;) 
i 


= X mi(r7w — (w - ri)r;) 
= Iw (3.27) 
In the body frame, we can write L = La€a to get 
La _ Tabb (3.28) 


where w = Wa€a. Note that in general, w is not equal to L: the spin of the body and 
its angular momentum point in different directions. This fact will lead to many of the 
peculiar properties of spinning objects. 


3.3 Euler’s Equations 


So far we have been discussing the rotation of a body fixed at a point P. In this section 
we will be interested in the rotation of a free body suspended in space - for example, a 
satellite or the planets. Thankfully, this problem is identical to that of an object fixed 
at a point. Let’s show why this is the case and then go on to analyse the motion. 
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rt) = RW) + Ar(t) 


Figure 28: 


The most general motion of a body is an overall translation superposed with a ro- 
tation. We could take this rotation to be about any point in the body (or, indeed, a 
point outside the body). But it is useful to consider the rotation to be about the center 
of mass. We can write the position of a particle in the body as 


r;(t) = R(t) + Ar;(t) (3.29) 


where Ar; is the position measured from the centre of mass. Then examining the 
kinetic energy (which, for a free body, is all there is) 


1 X 52 
i 


Ym, [iR +R. (w x Ar;) + iw x Ar,)’ 


T 


where we’ve used the fact that $`; m;A^Ar; = 0. So we find that the dynamics separates 
into the motion of the centre of mass R, together with rotation about the centre of 
mass. This is the reason that the analysis of the last section is valid for a free object. 
3.3.1 Euler’s Equations 


From now on, we shall neglect the center of mass and concentrate on the rotation of 
the rigid body. Since the body is free, its angular momentum must be conserved. This 
gives us the vector equation 


—=0 (3.31) 
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Let’s expand this in the body frame. we have 


dL dLa dea 
i — dt Sata dt 
L 
— a ea + Law X ea (3.32) 


This simplifies if we choose the body axes {ea} to coincide with the the principal axes. 
Using La = lawa, we can then write Lı = Mw and so on. The equations of motion 
(3.32) are now three non-linear coupled first order differential equations, 

hw ar wW3 (3 = Ip) = () 

Tow T W3W, (L — I3) = 0 (3.33) 

T3003 =f wiwal I> = L) =0 


These are Fuler’s Equations. 


We can extend this analysis to include a torque T. The equation of motion becomes 
L = T and we can again expand in the body frame along the principal axes to derive 
Euler’s equations (3.33), now with the components of the torque on the RHS. 


3.4 Free Tops 


“To those who study the progress of exact science, the common spinning-top 
is a symbol of the labours and the perplexities of men.” 
James Clerk Maxwell, no less 


In this section, we’ll analyse the motion of free rotating bod- e3 
ies (known as free tops) using Euler’s equation. 


We start with a trivial example: the sphere. For this object, 
I, = ÍI = I3 which means that the angular velocity w is parallel 
to the angular momentum L. Indeed, Euler’s equations tell us 
that wa is a constant in this case and the sphere continues to spin 
around the same axis you start it on. To find a more interesting 
case, we need to look at the next simplest object. 


3.4.1 The Symmetric Top 


Figure 29: 


The symmetric top is an object with J, = Ip # Iz. A typical example is drawn in figure 
29. Euler’s equations become 

hw = ww3 (ty — Is) 

Tow = —ww3 (h = Iz) (3.34) 
0 


Tzw 
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Figure 30: The precession of the spin: the direction of precession depends on whether the 
object is short and fat (I3 > I) or tall and skinny (I3 < I) 


So, in this case, we see that w3, which is the spin about the symmetric axis, is a 
constant of motion. In contrast, the spins about the other two axes are time dependent 
and satisfy 


Wy = Qw? , We = —Qw, (3.35) 
where 
Q = wh — Is)/hh (3.36) 
is a constant. These equations are solved by 
(w1, w2) = wolsin Nt, cos Nt) (3.37) 


for any constant wọ. This means that, in the body frame, the direc- 
tion of the spin is not constant: it precesses about the e3 axis with 
frequency Q. The direction of the spin depends on the sign on Q or, 
in other words, whether J} > J3 or , < J3. This is drawn in figure 
30. 


What does this look like in an inertial frame? The angular momen- 
tum L is a fixed vector. But w3, and hence L3, are also fixed which 
ensures that the angle between e3 and L doesn’t change in time. In- 


stead, e3 precesses around L, while the body simultaneously spins 

such that w remains between e3 and L. (If this is hard to visualise, Figure 31: 
then try searching YouTube for ”free symmetric tops” or something 

similar and you will find videos like this.) The fact that w precesses around the body 
frame axis e3 is sometimes referred to as a wobble. 
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3.4.2 Example: The Earth’s Wobble 


The spin of the earth causes it to bulge at the equator so it is no longer a sphere but can 
be treated as a symmetric top. It is an oblate ellipsoid, with [3 > I, and is spherical 
to roughly 1 part in 300, meaning 


L — Í; 1 
x — 3.38 
L 300 ( ) 


Of course, we know the magnitude of the spin ws: it is w3 = (1 day)~'. This information 
is enough to calculate the frequency of the earth’s wobble; from (3.36), it should be 


Ona = = day~* (3.39) 
This calculation was first performed by Euler in 1749 who pre- __ Noth Pole 
dicted that the Earth completes a wobble every 300 days. De- ‘ i 
spite many searches, this effect wasn’t detected until 1891 when ; 


Chandler re-analysed the data and saw a wobble with a period 
of 427 days. It is now known as the Chandler wobble. It is 
| 


very small! The angular velocity w intercepts the surface of 
the earth approximately 10 metres from the North pole and \ 


precesses around it. More recent measurements place the fre- 


Equator 


quency at 435 days, with the discrepancy between the predicted 
300 days and observed 435 days due to the fact that the Earth 
is not a rigid body, but is flexible because of tidal effects. Less Figure 32: 

well understood is why these same tidal effects haven’t caused 

the wobble to dampen and disappear completely. There are various theories about 
what keeps the wobble alive, from earthquakes to fluctuating pressure at the bottom 
of the ocean. 


3.4.3 The Asymmetric Top: Stability 


The most general body has no symmetries and , # I 4 I # h. The rotational 
motion is more complicated but there is a simple result that we will describe here. 
Consider the case where the spin is completely about one of the principal axes, say e1. 
Le 


Wy = Q 5 W2 = W3 = 0 (3.40) 


This solves Euler’s equations (3.33). The question we want to ask is: what happens if 
the spin varies slightly from this direction? To answer this, consider small perturbations 
about the spin 


wi=Q+M , W2 5N ž , W3 = 13 (3.41) 


se 


where ņa, a = 1,2,3 are all taken to be small. Substituting this into Euler’s equations 
and ignoring terms of order 7? and higher, we have 


hin = 0 
Iaio = Onz (Is = L) (3.42) 
13773 = One (hy = Ip) (3.43) 


We substitute the third equation into the second to find an equation for just one of the 
perturbations, say 72, 


2 


re 
lñ = T (13 = L) = In = Ano (3.44) 


3 
The fate of the small perturbation depends on the sign of the quantity A. We have two 
possibilities 

e A <Q: In this case, the disturbance will oscillate around the constant motion. 
e A >Q: In this case, the disturbance will grow exponentially. 
Examining the definition of A, we find that the motion is unstable if 


Ty < I, < Tz or Tz < I, < Ty (3.45) 


with all other motions stable. In other words, a body will rotate stably about the axis 
with the largest or the smallest moment of inertia, but not about the intermediate axis. 
Pick up a tennis racket and try it for yourself! 


3.4.4 The Asymmetric Top: Poinsot Construction 


The analytic solution for the general motion of an asymmetric top is rather complicated, 
involving Jacobian elliptic functions. But there’s a nice geometrical way of viewing the 
motion due to Poinsot. 


We start by working in the body frame. There are two constants of motion: the 
kinetic energy T and the magnitude of the angular momentum L?. In terms of the 
angular velocity, they are 


2T = hw? al. hw? + Bws (3.46) 
L? = Rw? + Bw + ws (3.47) 
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Each of these equations defines an ellipsoid in w space. The motion of the vector w is 
constrained to lie on the intersection of these two ellipsoids. The first of these ellipsoids, 
defined by 


I 
2T 


L 


om wi + =w =l (3.48) 


2 
wy + OT 
is known as the inertia ellipsoid (or, sometimes, the inertia quadric). If we fix the 
kinetic energy, we can think of this abstract ellipsoid as embedded within the object, 


rotating with it. 


The inertia ellipsoid is drawn in figure 
33, where we’ve chosen I; > Ig > Ig so that 
the major axis is w and the minor axis is w4. 
The lines drawn on the figure are the inter- 
section of the inertia ellipsoid with the other 
ellipsoid, defined by (3.47), for various val- 


ues of L?. Since this has the same major and 
minor axes as the inertia ellipsoid (because 
I? > I} > I), the intersection lines are small Figure 33: 

circles around the wı and w3 axes, but two 

lines passing through the wz axis. For fixed T and L?, the vector w moves along one 
of the intersection lines. This provides a pictorial demonstration of the fact we learnt 
in the previous subsection: an object will spin in a stable manner around the principal 
axes with the smallest and largest moments of inertia, but not around the intermediate 
axis. The path that w traces on the inertia ellipsoid is known as the polhode curve. We 
see from the figure that the polhode curves are always closed, and motion in the body 
frame is periodic. 


So much for the body frame. What does all this look like in the space frame? The 
vector L is a constant of motion. Since the kinetic energy 2T = L- w is also constant, 
we learn that w must lie in a fixed plane perpendicular to L. This is known as the 
invariable plane. The inertia ellipsoid touches the invariable plane at the point defined 
by the angular velocity vector w. Moreover, the invariable plane is always tangent to 
the inertial ellipsoid at the point w. To see this, note that the angular momentum can 
be written as 


L = VT (3.49) 


where the gradient operator is in w space, i.e. Vy = (0/0u1,0/Ow2,0/0w3). But 
recall that the inertia ellipsoid is defined as a level surface of T, so equation (3.49) tells 
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Figure 34: The inertia ellipsoid rolling around on the invariable plane, with the polhode 
and herpolhode curves drawn for a fixed time period. 


us that the angular momentum L is always perpendicular to the ellipsoid. This, in 
turn, ensures that the invariable plane is always tangent to the ellipsoid. In summary, 
the angular velocity traces out two curves: one on the inertia ellipsoid, known as the 
polhode curve, and another on the invariable plane, known as the herpolhode curve. 
The body moves as if it is embedded within the inertia ellipsoid, which rolls around the 
invariable plane without slipping, with the center of the ellipsoid a constant distance 
from the plane. The motion is shown in figure 34. Unlike the polhode curve, the 
herpolhode curve does not necessarily close. 


An Example: The Asteroid Toutatis 


Astronomical objects are usually symmetric, but there’s an important exception wan- 
dering around our solar system, depicted in figure! 35. This is the asteroid Toutatis. In 
September 2004 it passed the earth at a distance of about four times that to the moon. 
This is (hopefully!) the closest any asteroid will come for the next 60 years. The orbit 
of Toutatis is thought to be chaotic, which could potentially be bad news for Earth 
a few centuries from now. As you can see from the picture, its tumbling motion is 
complicated. It is aperiodic. The pictures show the asteroid at intervals of a day. The 
angular momentum vector L remains fixed and vertical throughout the motion. The 
angular velocity w traces out the herpolhode curve in the horizontal plane, perpendic- 
ular to L. The angular momentum vector w also traces out a curve over the asteroid’s 


'This picture was created by Scott Hudson of Washington State University and was taken from 
http://www.solarviews.com/eng/toutatis.htm where you can find many interesting facts about the 
asteroid. 
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Figure 35: By Toutatis! The three principal axes are shown in red, green and blue (without 
arrows). The angular momentum L is the vertical, purple arrow. The angular velocity w is 
the circled, yellow arrow. 


surface: this is the polhode curve. It has a period of 5.4 days which you can observe 
by noting that w has roughly the same orientation relative to the principal axes every 
five to six days. 


However, there are further effects at play in a real object like Toutatis which is not 
spinning around a principal axis. Various stresses and strains lead to dissipation. This 
means that the angular velocity w does not quite follow the polhode curve in Figure 
33. Instead it begins close to the major axis w3 and slowly spirals towards the minor 
axis W,. This is why we see so few wobbling asteroids. 
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3.5 Euler’s Angles 


So far we’ve managed to make quite a lot of progress working just with the angular 
velocity wa and we haven’t needed to introduce an explicit parametrization of the 
configuration space C. But to make further progress we’re going to need to do this. 
We will use a choice due to Euler which often leads to simple solutions. 


Figure 36: The rotation from space frame {€,} to body frame {ea}. 


A general rotation of a set of axis is shown in Figure 36. We’d like to construct a 
way of parameterizing such a rotation. The way to do this was first described by Euler: 


Euler’s Theorem: 


An arbitrary rotation may be expressed as the product of 3 successive rotations about 
3 (in general) different axes. 


Proof: Let {č} be space frame axes. Let {e,} be body frame axes. We want to find 
the rotation R so that ea = Rač. We can accomplish this in three steps 


RAD) airy RW) e] (3.50) 
Let’s look at these step in turn. 


Step 1: Rotate by ¢ about the čą axis. So e = R3()qpéy with 


cos@ sing 0 
R3(¢) = | —sin¢ cos¢ 0 (3.51) 
0 0 1 


This is shown in Figure 37. 


— §2 = 


Figure 37: Step 1: Rotate around the space-frame axis €3. 


Step 2: Rotate by @ about the new axis e}. This axis ej} is sometimes called the 
“line of nodes”. We write e” = Ri(@) we, with 


1 0 0 
Rı(0) = | 0 cos sind (3.52) 


0 — sin cos 


This is shown in Figure 38 


j 
s 


Figure 38: Step 2: Rotate around the new axis axis e}. 


Step 3: Rotate by w about the new new axis e§ so eg = Ra(Y)ae} with 


cos% siny 0 
R3(w) = | -siny cosy 0 (3.53) 
0 0 1 


This is shown in Figure 39. 
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Figure 39: Step 3: Rotate around the latest axis e}. 


So putting it all together, we have 


Ral(¢, 0, p) = [Rs (Y) Ri (9) R3()las (3.54) 


The angles ¢, 6, are the Euler angles. If we write out the matrix R(¢, 0, y) longhand, 
it reads 


cos w cos @ — cos 0 sin ġ sin w sin d cos Y + cos Osiny coso sin siny 
R= | —cosdsinw —cos@cosysing — sin y sin + cos cosyceoso sin 0 cosy 
sin ô sin @ — sin 0 cos @ cos 0 


Note: Recall that we may expand a vector r either in the body frame r = raea, or in 
the space frame r = faa. The above rotations can be equally well expressed in terms 
of the coordinates r, rather than the basis {ea}: we have 7 = raRa. Be aware that 
some books choose to describe the Euler angles in terms of the coordinates ra which 
they write in vector form. In some conventions this can lead to an apparent reversal in 
the ordering of the three rotation matrices. 


3.5.1 Leonhard Euler (1707-1783) 


As is clear from the section headings, the main man for this chapter is Euler, by far the 
most prolific mathematician of all time. As well as developing the dynamics of rotations, 
he made huge contributions to the fields of number theory, geometry, topology, analysis 
and fluid dynamics. For example, the lovely equation e = cos + isin@ is due to 
Euler. In 1744 he was the first to correctly present a limited example of the calculus of 
variations (which we saw in section 2.1) although he generously gives credit to a rather 
botched attempt by his friend Maupertuis in the same year. Euler also invented much 
of the modern notation of mathematics: f(x) for a function; e for exponential; m for, 


well, m and so on. 
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Euler was born in Basel, Switzerland and held positions in St Petersburg, Berlin and, 
after falling out with Frederick the Great, St Petersburg again. He was pretty much 
absorbed with mathematics day and night. Upon losing the sight in his right eye in 
his twenties he responded with: “Now I will have less distraction”. Even when he went 
completely blind later in life, it didn’t slow him down much as he went on to produce 
over half of his total work. The St Petersburg Academy of Science continued to publish 
his work for a full 50 years after his death. 


3.5.2 Angular Velocity 


There’s a simple expression for the instantaneous angular velocity w in terms of Euler 
angles. To derive this, we could simply plug (3.54) into the definition of angular velocity 
(3.5). But this is tedious, and a little bit of thought about what this means physically 
will get us there quicker. Consider the motion of a rigid body in an infinitesimal time 
dt during which 


(V, 0, p) > (Y + dy,6 + d0, p + dg) (3.55) 
From the definition of the Euler angles, the angular velocity must be of the form 
w = o@+0e, +e; (3.56) 


But we can express the first two vectors in terms of the body frame. They are 
e3 = sindsinw e1 + sin 0 cos w e2 + cos 0 e3 
e] = cos Y e — sin Y eg (3:57) 
from which we can express w in terms of the Euler angles in the body frame axis 
w= lọ sin @ sin Y + 6 cos vjei + lọ sin 6 cos y — ĝ sin weg + h) + dcos lez (3.58) 


By playing a similar game, we can also express w in the space frame axis. 


3.5.3 The Free Symmetric Top Revisited 


In section 3.4 we studied the free symmetric top working in the body frame and found 
a constant spin w3 while, as shown in equation (3.37), w; and we precess as 
(hi — Is) 

l 


But what does this look like in the space frame? Now that we have parametrised 


(w1, w2) = wolsin Qt, cos Qt) with Q = ws (3.59) 


motion in the space frame in terms of Euler angles, we can answer this question. This 
is simplest if we choose the angular momentum L to lie along the è space-axis. Then 
because both L and its body-frame component L3 = [3w3 are conserved, so too is the 
angle between them. But this angle is precisely 6. We learn that 6 = 0. 
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Figure 40: Euler angles for the free symmetric top when L coincides with és 


Next, we use the equations (3.58) to solve for ¢(t) and w(t). The solution is given by 
b=. (This relation also follows by starting at the figure and thinking about how the 
object spins as it precesses around L.) But we know from (3.58) that the expression 
for w3 (which, remember, is the component of w in the body frame) in terms of Euler 
angles is w3 = w + dcosé so, substituting for Q = 7), we find the precession frequency 


Izu 


= 3.60 
I, cos 0 ( ) 


Staring at the figure, the relationship y = may be not be immediately clear. 


An Example: The Wobbling Plate 
The physicist Richard Feynman tells the following story: 


“T was in the cafeteria and some guy, fooling around, throws a plate in the 
air. As the plate went up in the air I saw it wobble, and I noticed the red 
medallion of Cornell on the plate going around. It was pretty obvious to me 
that the medallion went around faster than the wobbling. 


I had nothing to do, so I start figuring out the motion of the rotating plate. 
I discover that when the angle is very slight, the medallion rotates twice as 
fast as the wobble rate — two to one. It came out of a complicated equation! 


I went on to work out equations for wobbles. Then I thought about how the 
electron orbits start to move in relativity. Then there’s the Dirac equation 
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in electrodynamics. And then quantum electrodynamics. And before I knew 
it....the whole business that I got the Nobel prize for came from that piddling 
around with the wobbling plate.” 


Feynman was right about quantum electrodynamics. But what 
about the plate? We can look at this easily using what we’ve learnt. 
The spin of the plate is wz, while the precession, or wobble, rate is ¢ 
which is given in (3.60). To calculate this, we need the moments of 


inertia for a plate. But we figured this out for the disc in Section 3.2 
where we found that [3 = 2/1. We can use this to see that Y = —w3 
for this example and so, for slight angles 6, have 


b ~ u) (3.61) Figure 41: 


Or, in other words, the wobble rate is twice as fast as the spin of the plate. It’s the 
opposite to how Feynman remembers! 


There is another elegant and simple method you can use to see that Feynman was 
wrong: you can pick up a plate and throw it. It’s hard to see that the wobble to spin 
ratio is exactly two. But it’s easy to see that it wobbles faster than it spins. 


3.6 The Heavy Symmetric Top 


The “heavy” in the title of this section means that the top is acted upon by gravity. 
We’ll deal only with a symmetric top, pinned at a point P which is a distance / from 
the centre of mass. This system is drawn in the figure. The principal axes are e1, e2 
and e3 and we have J, = Jy. From what we have learnt so far, it is easy to write down 
the Lagrangian: 


L = 4h (w? + w2) + 41;w3 — Mglcosd 


= 17, (6 + sin? 667) + TAC + cos 0 $)? — Mgl cos 0 (3.62) 


A quick examination of this equation tells us that both ọ and ¢ are ignorable coordi- 
nates. This gives us the constants of motion p, and pg, where 


Dy = Is(b + cos0 ¢) = Bws (3.63) 


This is the angular momentum about the symmetry axis e3 of the top. The angular 
velocity w3 about this axis is simply called the spin of the top and, as for the free 
symmetric top, it is a constant. The other constant of motion is 


ps = I sin? 6¢ + Iz cos 6 (yb + ġ cos 0) (3.64) 


SOs 


Figure 42: The heavy top with its Euler angles 


As well as these two conjugate momenta, the total energy E is also conserved 


E=T+V = 17, (8? + ġ? sin? 0) + 5 T3w3 + Mglcos0 (3.65) 

To simplify these equations, let’s define the two constants 

[3003 Po 
= —— d b = — 3.66 
L an T ( ) 
Then we can write 
. b—acosé 
sin“ @ 


and 

Ia  (b— acos 8) cos 8 
I; F sin? 0 

So if we can solve 0 = 0 (t) somehow, then we can always integrate these two equations 
to get (t) and y(t). But first we have to figure out what 0 is doing. To do this, let’s 


define the “reduced energy” E’ = E — iws. Then, since Æ and wz are constant, so is 
E'. We have 


y= (3.68) 


E' = 11,6? + V(b) (3.69) 
where the effective potential is 


I (b — a cos 0)? 
2sin? 0 


Ver(0) = + Mgl cos 0 (3.70) 
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Figure 43: 


So we’ve succeeded in getting an equation (3.69) purely in terms of 0. To simplify the 
analysis, let’s define the new coordinate 


u = cosé (3.71) 


Clearly —1 < u < 1. We'll also define two further constants to help put the equations 
in the most concise form 


_ 2B! 


2Mgl 
a= and 8 ~= 2 


I Ty 


With all these redefinitions, the equations of motion (3.67), (3.68) and (3.69) can be 
written as 


(3.72) 


w = (1—u’)(a — Bu) — (b — au)? = f(u) (3.73) 
ọ = a (3.74) 
. [ha  u(b—au) 
pats (3.75) 


We could take the square root of equation (3.73) and integrate to reduce the problem 
to quadrature. The resulting integral is known as an “elliptic integral”. But, rather 
than doing this, there’s a better way to understand the physics qualitatively. 


Note that the function f(u) defined in (3.73) is a cubic polynomial that behaves as 


+00 as u —> œ 
flu) > 316 
(u —oo as u > —0 l ) 


and f(+1) = —(b F a)? < 0. So if we plot the function f(u), it looks like figure 43 
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Motion 1) Motion 2) Motion 3) 


Figure 44: The three different types of motion depend on the direction of precession at the 
extremal points. 


The physical range is ù? = f(u) > 0 and —1 < u < 1 so we find that, like in the 
spherical pendulum and central force problem, the system is confined to lie between 
the two roots of f(u). 


There are three possibilities for the motion depending on the sign of ¢ at the two 
roots u = u; and u = ug as determined by (3.74). These are 


e ġ > 0 at both u = u and u = us 
e ġ > 0 at u= u, but ġ < 0 at u = us 
e ġ > 0 at u = u and ġ = 0 at u = uz 


The different paths of the top corresponding to these three possibilities are shown in 
figure 44. Motion in ¢ is called precession while motion in 0 is known as nutation. 
3.6.1 Letting the Top Go 


The last of these three motions is not as unlikely as it may first appear. Suppose 
we spin the top and let it go at some angle 0. What happens? We have the initial 


conditions 
ba =0 => flu) =0 
=> Ut=0 = U2 
and gd.0 =0 => b- au =0 
b 
=> Ut=0 = — (3.77) 
a 


— = 


Remember also that the quantity 
Po = I,dsin? 6 + Iw; cos 6 = I3w3 cos Oio (3.78) 


is a constant of motion. We now have enough information to figure out the qualitative 
motion of the top. Firstly, it starts to fall under the influence of gravity, so 0 increases. 
But as the top falls, db must turn and increase in order to keep pg constant. Moreover, 
we also see that the direction of the precession ¢ must be in the same direction as the 
spin w3 itself. What we get is motion of the third kind. 


3.6.2 Uniform Precession 


Can we make the top precess with bobbing up and down? i.e. with 6 = 0 and db 
constant. For this to happen, we would need the function f(u) to have a single root uo 
lying in the physical range —1 < up < +1. This root must satisfy, 


f (uo) = (1 — u2)(a — Bug) — (b — au)? = 0 (3.79) 
and f'(uo) = —2uo(a — Buo) — B(1 — uz) + 2a(b — auo) = 0 


Combining these, we find 5B = ad — Quo. Substitut- w / 
ing the definitions ja = I3w3 and 6 = 2Mgl/I, into this J 
expression, we find a / 


Mgl = b( 1303 = ld COS 0o) (3.80) i / J 
/ = 


The interpretation of this equation is as follows: for a fixed 
value of w3 (the spin of the top) and 6o (the angle at which 
you let it go), we need to give exactly the right push ġ to 
make the top spin without bobbing. In fact, since equation (3.80) is quadratic in d, 


Figure 45: 


there are two frequencies with which the top can precess without bobbing. 


Of course, these “slow” and “fast” precessions only KA 
exist if equation (3.80) has any solutions at all. Since it is J 


w3 > 7, V Malh cos 0o (3.81) 


3 


quadratic, this is not guaranteed, but requires 


So we see that, for a given 6, the top has to be spinning fast 
enough in order to have uniform solutions. What happens Figure 46: 
if it’s not spinning fast enough? Well, the top falls over! 
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The Stable Sleeping Top i The Unstable Sleeping Top 
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Figure 47: The function f(u) for the stable and unstable sleeping top. 


3.6.3 The Sleeping Top 


Suppose we start the top spinning in an upright position, with 
0=ġ=0 (3.82) 


When it spins upright, it is called a sleeping top. The question we want to answer is: 
will it stay there? Or will it fall over? From (3.73), we see that the function f(u) must 
have a root at 0 = 0, or u = +1: f(1) = 0. From the definitions (3.66) and (3.72), we 
can check that a = b and a = £ in this situation and f(u) actually has a double zero 
at u = +1, 


f(u) = (1 - u)’ (a(1 +u) — a’) (3.83) 
The second root of f(u) is at uz = a?/a — 1. There are two possibilities 


1: u > 1 or w3 > 4I Mgl/I}. In this case, the graph of f(u) is drawn in first in 
figure 47. This motion is stable: if we perturb the initial conditions slightly, we will 
perturb the function f(u) slightly, but the physical condition that we must restrict to 
the regime f(u) > 0 means that the motion will continue to be trapped near u = 1 


2: u < 1 or w} < 4I; Mgl/I}. In this case, the function f(u) looks like the second 
figure of 47. Now the top is unstable; slight changes in the initial condition allow a 
large excursion. 


In practice, the top spins upright until it is slowed by friction to [3w3 = 2V lh Mgl, 
at which point it starts to fall and precess. 


3.6.4 The Precession of the Equinox 


The Euler angles for the earth are drawn in figure 48. The earth spins at an angle of 
0 = 23.5° to the plane of its orbit around the sun (known as the plane of the elliptic). 
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Figure 48: The precession of the earth. 


Sep 23 


The spin of the earth is w = (day)~!. This causes the earth to bulge at the equator 
so it is no longer a sphere, but rather a symmetric top. In turn, this allows the moon 
and sun to exert a torque on the earth which produces a precession ¢. Physically this 
means that the direction in which the north pole points traces a circle in the sky and 
what we currently call the “pole star” will no longer be in several thousand years time. 
It turns out that this precession is “retrograde” i.e. opposite to the direction of the 
spin. 


One can calculate the precession ġ of the earth due to the moon and sun using the 
techniques described in the chapter. But the calculation is rather long and we won’t go 
over it in this course. (See the book by Hand and Finch if you’re interested). Instead, 
we will use a different technique to calculate the precession of the earth: astrology.” 


To compute the precession of the earth, the first fact we need to know is that Jesus 
was born in the age of Pisces. This doesn’t mean that Jesus looked up Pisces in his 
daily horoscope (while scholars are divided over the exact date of his birth, he seems to 
exhibit many traits of a typical Capricorn) but rather refers to the patch of the sky in 
which the sun appears during the first day of spring. Known in astronomical terms as 
the “vernal equinox”, this day of the year is defined by the property that the sun sits 
directly above the equator at midday. As the earth precesses, this event takes place 
at a slightly different point in its orbit each year, with a slightly different backdrop of 
stars as a result. The astrological age is defined to be the background constellation in 
which the sun rises during vernal equinox. 


71 learnt about this fact from John Baez’ website where you can find lots of well written explanations 
of curiosities in mathematical physics: http://math.ucr.edu/home/baez/README. html. 
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It is easy to remember that Jesus was born in the age of Pisces since the fish was 
used as an early symbol for Christianity. The next fact that we need to know is 
that we’re currently entering the age of Aquarius (which anyone who has seen the 
musical Hair will know). So we managed to travel backwards one house of the zodiac 
in 2,000 years. We’ve got to make it around 12 in total, giving us a precession time of 
2,000 x 12 = 24,000 years. The actual value of the precession is 25,700 years. Our 
calculation is pretty close considering the method! 


The earth also undergoes other motion. The value of 0 varies from 22.1° to 24.5° 
over a period of 41,000 years, mostly due to the effects of the other planets. These also 
affect the eccentricity of the orbit over a period of 105,000 years. 


3.7 The Motion of Deformable Bodies 


Take a lively cat. (Not one that’s half dead like Schrédinger’s). Hold it upside down and 
drop it. The cat will twist its body and land sprightly on its feet. Yet it doesn’t do this 
by pushing against anything and its angular momentum is zero throughout. If the cat 
were rigid, such motion would be impossible since a change in orientation for a rigid 
body necessarily requires non-vanishing angular momentum. But the cat isn’t rigid 
(indeed, it can be checked that dead cats are unable to perform this feat) and bodies 
that can deform are able to reorient themselves without violating the conservation of 
angular momentum. In this section we’ll describe some of the beautiful mathematics 
that lies behind this. I should warn you that this material is somewhat more advanced 
than the motion of rigid bodies. The theory described below was first developed in the 


late 1980s in order to understand how micro-organisms swim’. 


3.7.1 Kinematics 


We first need to describe the configuration space C of a deformable body. We factor 
out translations by insisting that all bodies have the same center of mass. Then the 
configuration space C is the space of all shapes with some orientation. 


Rotations act naturally on the space C (they simply rotate each shape). This allows 
us to define the smaller shape space C so that any two configurations in C which are 
related by a rotation are identified in C. In other words, any two objects that have the 
same shape, but different orientation, are described by different points in C, but the 
same point in Č. Mathematically, we say C & C/SO(3). 


3See A. Shapere and F. Wilczek, “Geometry of Self-Propulsion at Low Reynolds Number”, J. Fluid 
Mech. 198 557 (1989) . This is the same Frank Wilczek who won the 2004 Nobel prize for his work 
on quark interactions. 
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Figure 49: Three possible shapes of a deformable object. 


We can describe this in more detail for a body consisting of N point masses, each with 
position r;. Unlike in section 3.1, we do not require that the distances between particles 
are fixed, i.e. |r; — r;| # constant. (However, there may still be some restrictions on 
the r;). The configuration space C is the space of all possible configurations r;. For 
each different shape in C, we pick a representative r; with some, fixed orientation. It 
doesn’t matter what representative we choose — just as long as we pick one. These 
variables #; are coordinates on the space shape Č. For each r; € C, we can always find 
a rotation matrix R € SO(3) such that 


As in section 3.1, we can always do this to continuous bodies. In this case, the con- 
figuration space C and the shape space C may be infinite dimensional. Examples of 
different shapes for a continuously deformable body are shown in figure 49. 


We want to understand how an object rotates as it changes shape keeping its angular 
momentum fixed (for example, keeping L = 0 throughout). The first thing to note is 
that we can’t really talk about the rotation between objects of different shapes. (How 
would you say that the the third object in figure 49 is rotated with respect to the first 
or the second?). Instead, we should think of an object moving through a sequence 
of shapes before returning to its initial shape. We can then ask if there’s been a net 
rotation. As the object moves through its sequence of shapes, the motion is described 
by a time dependent r;(t), while the corresponding change through the configuration 
space is 


r;(t) = R(t) E(t) (3.85) 


where the 3 x 3 rotation matrix R(t) describes the necessary rotation to go from our 
fixed orientation of the shape f to the true orientation. As in section 3.1.1, we can define 
the 3 x 3 anti-symmetric matrix that describes the instantaneous angular velocity of 
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the object. In fact, it will for once prove more useful to work with the “convective 
angular velocity” defined around equation (3.10) 


Ve T (3.86) 


This angular velocity is non-zero due to the changing shape of the object, rather than 
the rigid rotation that we saw before. Let’s do a quick change of notation and write 


coordinates on the shape space C as x“, with A = 1,...,3N instead of in vector 
notation r;, with ¿i = 1,..., N. Then, since Q is linear in time derivatives, we can write 
Q=O,4(2) «4 (3.87) 


The component 2.4(x) is the 3 x 3 angular velocity matrix induced if the shape changes 
from «4 to «4 + 6x4. It is independent of time: all the time dependence sits in 
the «4 factor which tells us how the shape is changing. The upshot is that for each 
shape x € C, we have a 3 x 3 anti-symmetric matrix N4 associated to each of the 
A= 1,...,3N directions in which the shape can change. 


However, there is an ambiguity in defining the angular velocity Q. This comes about 
because of our arbitrary choice of reference orientation when we picked a representative 
r; E€ C for each shape. We could quite easily have picked a different orientation, 


č; > S(x4) f; (3.88) 


where $(a4) is a rotation that, as the notation suggests, can vary for each shape x4. If 
we pick this new set of representative orientations, then the rotation matrix R defined 
in (3.85) changes: R(t) > R(t) S~!(a4). Equation (3.86) then tells us that the angular 
velocity also change as 


ost 


Q Qa s7! 
ADS AS +I AG 


(3.89) 


This ambiguity is related to the fact that we can’t define the meaning of rotation 
between two different shapes. Nonetheless, we will see shortly that when we come to 
compute the net rotation of the same shape, this ambiguity will disappear, as it must. 
Objects such as Q4 which suffer an ambiguity of form (3.89) are extremely important 
in modern physics and geometry. They are known as non-abelian gauge potentials to 
physicists, or as connections to mathematicians. 
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3.7.2 Dynamics 


So far we’ve learnt how to describe the angular velocity Q of a deformable object. The 


next step is to see how to calculate Q. We’ll now show that, up to the ambiguity 


described in (3.89), the angular velocity Q is specified by the requirement that the 


angular momentum L of the object is zero. 
L= > t r; x Č; 
= om, (Ri) x (Ré;) + (RE) x (fi) =i) 
In components this reads 


La = Cabe >» Mi | Rea Ree(Ei)a(ts). + Rrakeee(Fi)alFi)e| =0 


l 


The vanishing L = 0 is enough information to determine the following result: 


Claim: The 3 x 3 angular velocity matrix Qab = R Re is given by 
Qab = Cabe I La 
where J is the instantaneous inertia tensor of the shape described by #;, 
Tai = So mil (Fi f T;) dab = (Pal fea) 
and L; is the apparent angular momentum 


Ty = Cabe ` Mi (Ei Jalti)e 


i 


(3.90) 


(3.91) 


(3.92) 


(3.93) 


(3.94) 


Proof: We start by multiplying La by €arg. We need to use the fact that if we 


multiply two e-symbols, we have €asc€atg = (Ovfdeg — Ôbgôcsf). Then 


EafgLa = > Mi [RpaRge(Fi)a(Tide — RoaR peti) a(i)e 


i 


—RyaRje(Fia(Fi)e T RyalgelF)alF)e| =0 


(3.95) 


Now multiply by RfẹRgc- Since R is orthogonal, we known that RypRra = doa which, 


after contracting a bunch of indices, gives us 


R foRgcEafg La = > Mi [(Fi)o(Fie = (F;)-(Ti) = Ona ta T Qea(Fi)o (Fi) a] = 0 


i 


DE 


This is almost in the form that we want, but the indices aren’t quite contracted in the 
right manner to reproduce (3.92). One can try to play around to get the indices working 
right, but at this stage it’s just as easy to expand out the components explicitly. For 
example, we can look at 
Lı = > Mi [(F:)2(Ti)s = ((f,)3(i)2| 
= by mi [Qor(Fi)3 (Fi) + O23 (Fi) 3 (Fi)3 — Q31(Fi)o(Fi)a — O32(Ti)2(Ti)2] 


= PiN + DaN + 13012 = 1 abel abe (3.96) 


where the first equality is the definition of L4, while the second equality uses our result 
above, and the third equality uses the definition of J given in (3.93). There are two 
similar equations, which are summarised in the formula 


Ls i Galea (3.97) 


Multiplying both sides by I~! gives us precisely the claimed result (3.92). This con- 
cludes the proof. 


To summarise: a system with no angular momentum that can twist and turn and 
change its shape has an angular velocity (3.92) where r;(t) is the path it chooses to take 
through the space of shapes. This is a nice formula. But what do we do with it? We 
want to compute the net rotation R as the body moves through a sequence of shapes 
and returns to its starting point at a time T later. This is given by solving (3.86) for 
R. The way to do this was described in section 3.1.2. We use path ordered exponentials 


R= Pexp (f a% it) = Pexp ($ Qu da^ ) (3.98) 


The path ordering symbol P puts all matrices evaluated at later times to the right. 
(This differs from the ordering in section 3.1.2 where we put later matrices to the left. 
The difference arises because we’re working with the angular velocity Q = R7!R instead 
of the angular velocity w = RR). In the second equality above, we’ve written the 
exponent as an integral around a closed path in shape space. Here time has dropped 
out. This tells us an important fact: it doesn’t matter how quickly we perform the 
change of shapes — the net rotation of the object will be the same. 


In particle physics language, the integral in (3.98) is called a “Wilson loop”. We can 
see how the rotation fares under the ambiguity (3.87). After some algebra, you can 
find that the net rotation R of an object with shape x“ is changed by 


R > S(x^) RS(24)"} (3.99) 
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This is as it should be: the S~! takes the shape back to our initial choice of standard 
orientation; the matrix R is the rotation due to the change in shape; finally S puts us 
back to the new, standard orientation. So we see that even though the definition of the 
angular velocity is plagued with ambiguity, when we come to ask physically meaningful 
questions — such as how much has a shape rotated — the ambiguity disappears. 
However, if we ask nonsensical questions — such as the rotation between two different 
shapes — then the ambiguity looms large. In this manner, the theory contains a rather 
astonishing new ingredient: it lets us know what are the sensible questions to ask! 
Quantities for which the ambiguity (3.87) vanishes are called gauge invariant. 


In general, it’s quite hard to explicitly compute the integral (3.98). One case where 
it is possible is for infinitesimal changes of shape. Suppose we start with a particular 
shape xù, and move infinitesimally in a loop in shape space: 


zalt) = 2%, + a(t) (3.100) 
Then we can Taylor expand our angular velocity components, 
oQ 
Q4(x(t)) = a(x) + a ap (3.101) 
xo 


Expanding out the rotation matrix (3.98) and taking care with the ordering, one can 
show that 


1 
R=1+ JT AB f QAQB dt + O(a?) 
1 
ie. J Fag dAgp + O(a?) (3.102) 


where F'4pg is anti-symmetric in the shape space indices A and B, and is a 3 x 3 matrix 

(the a,b = 1,2,3 indices have been suppressed) given by 

— OQ, ONB, 
OrB ðr | 

It is known as the field strength to physicists (or the curvature to mathematicians). It is 

evaluated on the initial shape x°,. The second equality in (3.102) gives the infinitesimal 


Pap [Q.4, Qa] (3.103) 


rotation as the integral of the field strength over the area traversed in shape space. This 
field strength contains all the information one needs to know about the infinitesimal 
rotations of objects induced by changing their shape. 


One of the nicest things about the formalism described above is that it mirrors very 
closely the mathematics needed to describe the fundamental laws of nature, such as 
the strong and weak nuclear forces and gravity. They are all described by “non-abelian 
gauge theories”, with an object known as the gauge potential (analogous to Q4) and 
an associated field strength. 
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4. The Hamiltonian Formalism 


We’ll now move onto the next level in the formalism of classical mechanics, due initially 
to Hamilton around 1830. While we won’t use Hamilton’s approach to solve any further 
complicated problems, we will use it to reveal much more of the structure underlying 
classical dynamics. If you like, it will help us understands what questions we should 
ask. 


4.1 Hamilton’s Equations 


Recall that in the Lagrangian formulation, we have the function L(qi, qi, t) where qi 


(i =1,...,n) are n generalised coordinates. The equations of motion are 
d (ƏL OL 
=] = =0 (4.1) 
dt \ Odi qi 


22d order differential equations which require 2n initial conditions, say 


These are n 
qilt = 0) and q(t = 0). The basic idea of Hamilton’s approach is to try and place qi 
and qg; on a more symmetric footing. More precisely, we’ll work with the n generalised 


momenta that we introduced in section 2.3.3, 


_ OL 


i= = p= 1 
i OGi i 


eaten (4.2) 
so pi = p;(q;,4;,t). This coincides with what we usually call momentum only if we 
work in Cartesian coordinates (so the kinetic term is smi). If we rewrite Lagrange’s 
equations (4.1) using the definition of the momentum (4.2), they become 


OL 
Pi = BG, 


(4.3) 


The plan will be to eliminate q; in favour of the momenta p;, and then to place q; and 
pi on equal footing. 


Figure 50: Motion in configuration space on the left, and in phase space on the right. 
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Let’s start by thinking pictorially. Recall that {q;} defines a point in n-dimensional 
configuration space C. Time evolution is a path in C. However, the state of the system 
is defined by {q;} and {p;} in the sense that this information will allow us to determine 
the state at all times in the future. The pair {q;, pi} defines a point in 2n-dimensional 
phase space. Note that since a point in phase space is sufficient to determine the future 
evolution of the system, paths in phase space can never cross. We say that evolution 
is governed by a flow in phase space. 


An Example: The Pendulum 


Consider a simple pendulum. The configuration space is clearly a circle, St, parame- 
terised by an angle 0 € [—7, 7). The phase space of the pendulum is a cylinder R x St, 
with the R factor corresponding to the momentum. We draw this by flattening out the 
cylinder. The two different types of motion are clearly visible in the phase space flows. 


1 4 Py 


j Rotating anti-clockwise 
e e E 


BSN 
a E (libration) 
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Figure 51: Flows in the phase space of a pendulum. 


For small 0 and small momentum, the pendulum oscillates back and forth, motion 
which appears as an ellipse in phase space. But for large momentum, the pendulum 
swings all the way around, which appears as lines wrapping around the St of phase 
space. Separating these two different motions is the special case where the pendulum 


starts upright, falls, and just makes it back to the upright position. This curve in phase 
space is called the separatiz. 


4.1.1 The Legendre Transform 


We want to find a function on phase space that will determine the unique evolution 
of q; and p;. This means it should be a function of q; and p; (and not of q;) but must 
contain the same information as the Lagrangian L(qi, qi, t). There is a mathematical 
trick to do this, known as the Legendre transform. 


To describe this, consider an arbitrary function f(x,y) so that the total derivative is 


_ of, Of 
df = ade + 5 


Now define a function g(x,y, u) = ux — f(x,y) which depends on three variables, x, y 
and also u. If we look at the total derivative of g, we have 
Of Of 


dg = d(ux) — df = u dz + xz du — —dx — —dy (4.5) 
Ox Oy 


dy (4.4) 


At this point u is an independent variable. But suppose we choose it to be a specific 
function of x and y, defined by 


u(x, y) = SE (4.6) 


Then the term proportional to dx in (4.5) vanishes and we have 
o 
dg = x du — ay (4.7) 


Or, in other words, g is to be thought of as a function of u and y: g = glu, y). If we 
want an explicit expression for g(u, y), we must first invert (4.6) to get x = z(u, y) and 
then insert this into the definition of g so that 


glu, y) = ux(u, y) = f(x(u, y), y) (4.8) 


This is the Legendre transform. It takes us from one function f(z, y) to a different func- 
tion g(u, y) where u = Of /Ox. The key point is that we haven’t lost any information. 
Indeed, we can always recover f(x,y) from g(u,y) by noting that 
o 
<en x(u, y) and 
ðu |y 


== (4.9) 


which assures us that the inverse Legendre transform f = (0g/Ou)u — g takes us back 
to the original function. 
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The geometrical meaning of the Legendre transform 4 ge 


is captured in the diagram. For fixed y, we draw the two 
curves f(x,y) and ux. For each slope u, the value of glu) A 
is the maximal distance between the two curves. To see i 
this, note that extremising this distance means 


d ð ’ 
T (uz — flx) =0 > u= a (4.10) a l 


This picture also tells us that we can only apply the Legen- 


dre transform to convex functions for which this maximum Figure 52: 
exists. Now, armed with this tool, let’s return to dynamics. 


4.1.2 Hamilton’s Equations 


The Lagrangian L(qi, qi, t) is a function of the coordinates q;, their time derivatives q; 
and (possibly) time. We define the Hamiltonian to be the Legendre transform of the 
Lagrangian with respect to the q; variables, 


Aq, pi, t) = X iti — L(G, di, t) (4.11) 
i=1 
where q; is eliminated from the right hand side in favour of p; by using 
OL 
P= aa = Pil djs Gib) (4.12) 


and inverting to get ġdi = ġi(qj, pj, t). Now look at the variation of H: 


L L L 
: -dq; + tl | nt) 


dH = (dp; ġi + pi dq) — ( 


Og ð o 
OL OL 
Pid — a dai — 5, dt (4.13) 
but we know that this can be rewritten as 
OH OH OH 
dH = dq; 4 dp; 4 dt 4.14 
Oa Op, 8! Ot oo 


So we can equate terms. So far this is repeating the steps of the Legendre transform. 
The new ingredient that we now add is Lagrange’s equation which reads p; = OL/0q;. 
We find 


ee _ OH 

P= DG: 

: OH 

qi = Op; (4.15) 
OL OH 
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These are Hamilton’s equations. We have replaced n 2™¢ order differential equations by 
2n 1% order differential equations for q; and p;. In practice, for solving problems, this 
isn’t particularly helpful. But, as we shall see, conceptually it’s very useful! 


4.1.3 Examples 


1) A Particle in a Potential 


Let’s start with a simple example: a particle moving in a potential in 3-dimensional 
space. The Lagrangian is simply 
te ce 
L= gint — V(r) (4.17) 
We calculate the momentum by taking the derivative with respect to r 
a 


which, in this case, coincides with what we usually call momentum. The Hamiltonian 


p mr (4.18) 


is then given by 
1 
pt P + V(r) (4.19) 


where, in the end, we’ve eliminated r in favour of p and written the Hamiltonian as a 
function of p and r. Hamilton’s equations are simply 


ae 
— Op m” 
OH 


which are familiar: the first is the definition of momentum in terms of velocity; the 
second is Newton’s equation for this system. 


2) A Particle in an Electromagnetic Field 


We saw in section 2.5.7 that the Lagrangian for a charged particle moving in an elec- 
tromagnetic field is 
L= imi? —e(ġ-— ti- A) (4.21) 


2 


From this we compute the momentum conjugate to the position 


L 
p= = = mt + eA (4.22) 
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which now differs from what we usually call momentum by the addition of the vector 
potential A. Inverting, we have 


1 
r= — —eA 4.23 
t= —(p—eA) (4.23) 
So we calculate the Hamiltonian to be 
H(p,r)=p-:r—L 


1 1 2 € 
—p-(p—eA) — |—(p— eA) — —(p—eA)-A 
=P (p — eA) zm P eA) e + AD eA) 


= (p — eA)? +e¢ (4.24) 


Now Hamilton’s equations read 


pa i pe ew 4.25 
t= gs T g P- eA) (4.25) 
while the p = —OH/Or equation is best expressed in terms of components 
; OH ð$ e oA, 
= =e i — 4.2 
Pa Ora © Ore m (pe eA») Ora ( 6) 


To show that this is equivalent to the Lorentz force law requires some rearranging of 
the indices, but it’s not too hard. 


An Example of the Example 


Let’s illustrate the dynamics of a particle moving in a magnetic field by looking at a 
particular case. Imagine a uniform magnetic field pointing in the z-direction: B = 
(0,0, B). We can get this from a vector potential B = V x A with 


A = (—By,0,0) (4.27) 


This vector potential isn’t unique: we could choose others related by a gauge transform 
as described in section 2.5.7. But this one will do for our purposes. Consider a particle 
moving in the (x, y)-plane. Then the Hamiltonian for this system is 


1 SME 
H = —(p,+eB — np? 4.28 
z (P + eBy)” + Py (4.28) 


From which we have four, first order differential equations which are Hamilton’s equa- 
tions 


Py = 0 
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1 
(Pz + eBy) 


a ; 
Py = p” (pr + eBy) 
v= (4.29) 
If we add these together in the right way, we find that B 
Py +eBx =a = const. (4.30) | 
and Le ia 
Pr = Mmt — eBy = b = const. (4.31) r 
which is easy to solve: we have Figure 53: 


y= ar + Rsin (w(t — to)) 
y= -2 + Rcos (w(t — to) ) (4.32) 


with a,b, R and tọ integration constants. So we see that the particle makes circles in 
the (x, y)-plane with frequency 


w= 2 (4.33) 


This is known as the cyclotron frequency. 


4.1.4 Some Conservation Laws 


In Section 2, we saw the importance of conservation laws in solving a given problem. 
The conservation laws are often simple to see in the Hamiltonian formalism. For ex- 
ample, 


Claim: If 0H/ot = 0 (i.e. H does not depend on time explicitly) then H itself is 
a constant of motion. 


Proof: 


dH OH. OH. ðH 
di Og" Op,’ Ot 


= —Pidi + qidi + (4.34) 
OH 
Ot 


OF 
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Claim: If an ignorable coordinate q doesn’t appear in the Lagrangian then, by con- 
struction, it also doesn’t appear in the Hamiltonian. The conjugate momentum pg is 
then conserved. 


Proof 
l oH 
Pq = Oq = 
4.1.5 The Principle of Least Action 


0 (4.35) 


Recall that in section 2.1 we saw the principle of least action from the Lagrangian 
perspective. This followed from defining the action 


t2 
s= | OO (4.36) 


th 
Then we could derive Lagrange’s equations by insisting that 6S = 0 for all paths with 
fixed end points so that 6q;(t1) = 6q;(t2) = 0. How does this work in the Hamiltonian 
formalism? It’s quite simple! We define the action 


S= f" o4- mar (4.37) 


tı 
where, of course, qi = qilqi, pi). Now we consider varying q; and p; independently. Notice 
that this is different from the Lagrangian set-up, where a variation of q; automatically 
leads to a variation of ìi. But remember that the whole point of the Hamiltonian 
formalism is that we treat q; and p; on equal footing. So we vary both. We have 


G OH OH 
ôS = I [piii + pbs Ap, Pi P Sq} at 
ti Pi qi 


2 (f. ðH ` ôH N 
7 i. i E on Re i E a ña} dt + [piðqi] ó (4.38) 


and there are Hamilton’s equations waiting for us in the square brackets. If we look 
for extrema 0S' = 0 for all dp; and åq; we get Hamilton’s equations 


OH , OH 
7 7 Odi 
Except there’s a very slight subtlety with the boundary conditions. We need the last 


(4.39) 


term in (4.38) to vanish, and so require only that 


dqi(ti) = ôqi(t2) = 0 (4.40) 


= 37 = 


while 6p; can be free at the end points t = tı and t = tə. So, despite our best efforts, 
qi and p; are not quite symmetric in this formalism. 


Note that we could simply impose 6p;(t,) = dp;(t2) = 0 if we really wanted to and 
the above derivation still holds. It would mean we were being more restrictive on the 
types of paths we considered. But it does have the advantage that it keeps q; and p; 
on a symmetric footing. It also means that we have the freedom to add a function to 
consider actions of the form 


S= [ (oa = Hq, p) + uw) (4.41) 


so that what sits in the integrand differs from the Lagrangian. For some situations this 
may be useful. 


4.1.6 What’s Your Name, Man? William Rowan Hamilton (1805-1865) 


The formalism described above arose out of Hamilton’s interest in the theory of optics. 
The ideas were published in a series of books entitled “Theory of Systems of Rays”, the 
first of which appeared while Hamilton was still an undergraduate at Trinity College, 
Dublin. They also contain the first application of the Hamilton-Jacobi formulation 
(which we shall see in Section 4.7) and the first general statement of the principal of 
least action, which sometimes goes by the name of “Hamilton’s Principle”. 


Hamilton’s genius was recognised early. His capacity to soak up classical languages 
and to find errors in famous works of mathematics impressed many. In an unprece- 
dented move, he was offered a full professorship in Dublin while still an undergraduate. 
He also held the position of “Royal Astronomer of Ireland”, allowing him to live at 
Dunsink Observatory even though he rarely did any observing. Unfortunately, the 
later years of Hamilton’s life were not happy ones. The woman he loved married an- 
other and he spent much time depressed, mired in drink, bad poetry and quaternions’. 


4.2 Liouville’s Theorem 


We’ve succeeded in rewriting classical dynamics in terms of first order differential equa- 
tions in which each point in phase space follows a unique path under time evolution. 
We speak of a flow on phase space. In this section, we’ll look at some of the properties 
of these flows 


4The ”What’s your name, man?” joke is not mine. I stole it from the outrageously talented Tim 
Blais, whose entertaining and scholarly account of the life of Hamilton can be seen on YouTube. 
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Figure 54: An infinitesimal volume element of phase space evolving in time. 


Liouville’s Theorem: Consider a region in phase space and watch it evolve over 
time. Then the shape of the region will generically change, but Liouville’s theorem 
states that the volume remains the same. 


Proof: Let’s consider an infinitesimal volume moving for an infinitesimal time. We 
start in a neighbourhood of the point (qi, pi) in phase space, with volume 


V = dqu.. . dqndpı . . . dpn (4.42) 


Then in time dt, we know that 


OH 2 
qi > a+ didt = qi + — dt = qi (4.43) 
Opi 
and 
OH 
Di > pi + pidt = pi — TAa = ĵi (4.44) 
So the new volume in phase space is 
V = dã .. . dĝndĝı ... dp, = (det J) V (4.45) 


where det J is the Jacobian of the transformation defined by the determinant of the 
2n x 2n matrix 


je Gs aa (4.46) 
Op;/Oq; Op; /Op; 


To prove the theorem, we need to show that det J = 1. First consider a single degree 
of freedom (i.e. n = 1). Then we have 


1 + (0? H/Opdq)dt 0? H/Op?) dt 
T en (ae ca CEI) =1+0(dt?) (4.47) 
—(0?H/0q*) dt 1 -— (0?H/dqdp) dt 
which means that 
d(det J) _ 
“+ =0 (4.48) 
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so that the volume remains constant for all time. Now to generalise this to arbitrary 
n, we have 


ôi; + (0?H/Op;0q,;)dt 0? H/Op,Op;) dt 
To compute the determinant, we need the result that det(1+¢M) = 1+¢TrM + O(e?) 


for any matrix M and small e. Then we have 


| H PH | ~ A 
det 7=1+ 5) (a. — ZE) dt + O(dt?) = 1 + O(dt?) (4.50) 


i 


and we’re done. 


4.2.1 Liouville’s Equation 


So how should we think about the volume of phase space? We could consider an 
ensemble (or collection) of systems with some density function p(p,q,t). We might 
want to do this because 


e We have a single system but don’t know the exact state very well. Then p is 
understood as a probability parameterising our ignorance and 


f plq, p, t) lI dpidqi = 1 (4.51) 


e We may have a large number N of identical, non-interacting systems (e.g. N = 
1073 gas molecules in a jar) and we really only care about the averaged behaviour. 
Then the distribution p satisfies 


/ p(q,P,t) [] sian: =N (4.52) 


In the latter case, we know that particles in phase space (i.e. dynamical systems) 
are neither created nor destroyed, so the number of particles in a given “comoving” vol- 
ume is conserved. Since Liouville tells us that the volume elements dpdq are preserved, 
we have dp/dt = 0. We write this as 


Op OpOH Op OH _ 4 


Ot ðq; Op; T Op; Ogi een 
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Rearranging the terms, we have, 
Op p ƏH Op OH 


(4.54) 


which is Liouville’s equation. 


Notice that Liouville’s theorem holds whether or not the system conserves energy. 
(i.e. whether or not OH/Ot = 0). But the system must be described by a Hamiltonian. 
For example, systems with dissipation typically head to regions of phase space with 
q; = 0 and so do not preserve phase space volume. 


The central idea of Liouville’s theorem — that volume of phase space is constant — 
is somewhat reminiscent of quantum mechanics. Indeed, this is the first of several oc- 
casions where we shall see ideas of quantum physics creeping into the classical world. 
Suppose we have a system of particles distributed randomly within a square AqAp in 
phase space. Liouville’s theorem implies that if we evolve the system in any Hamil- 
tonian manner, we can cut down the spread of positions of the particles only at the 
cost of increasing the spread of momentum. We’re reminded strongly of Heisenberg’s 
uncertainty relation, which is also written AqAp = constant. 


While Liouville and Heisenberg seem to be talking the same language, there are very 
profound differences between them. The distribution in the classical picture reflects 
our ignorance of the system rather than any intrinsic uncertainty. This is perhaps best 
illustrated by the fact that we can evade Liouville’s theorem in a real system! The 
crucial point is that a system of classical particles is really described by collection of 
points in phase space rather than a continuous distribution p(q,p) as we modelled it 
above. This means that if we’re clever we can evolve the system with a Hamiltonian 
so that the points get closer together, while the spaces between the points get pushed 
away. A method for achieving this is known as stochastic cooling and is an important 
part of particle collider technology. In 1984 van der Meer won the the Nobel prize for 
pioneering this method. 


4.2.2 Time Independent Distributions 


Often in physics we’re interested in probability distributions that don’t change explicitly 
in time (i.e. 0p/Ot = 0). There’s an important class of these of the form, 


p = p(H (4, p)) (4.55) 
To see that these are indeed time independent, look at 
Op _ Op OH — Op OH 
ðt — Op; Aq, OG: AD; 
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_ Op OH OH — Op OH OH | 
- OH Op; qi OH 0q; Op; E 


A very famous example of this type is the Boltzmann distribution 


p= exp (22) (4.57) 


(4.56) 


kT 


for systems at a temperature T. Here k is the Boltzmann constant. 


For example, for a free particle with H = p?/2m, the Boltzmann distribution is 
p = exp(—mr?/2kT) which is a Gaussian distribution in velocities. 


An historically more interesting example comes from looking at a free particle in a 
magnetic field, so H = (p — eA)?/2m (where we’ve set the speed of light c = 1 for 
simplicity). Then the Boltzmann distribution is 


p= exp (LE) = xp (22) 58 


which is again a Gaussian distribution of velocities. In other words, the distribution 
in velocities is independent of the magnetic field. But this is odd: the magnetism of 
solids is all about how the motion of electrons is affected by magnetic fields. Yet we’ve 
seen that the magnetic field doesn’t affect the velocities of electrons. This is known as 
the Bohr-van Leeuwen paradox: there can be no magnetism in classical physics! This 
was one of the motivations for the development of quantum theory. 


4.2.3 Poincaré Recurrence Theorem 


We now turn to work of Poincaré from around 1890. The following theorem applies to 
systems with a bounded phase space (i.e. of finite volume). This is not an uncommon 
occurrence. For example, if we have a conserved energy E = T +V with T > 0 and 
V > 0 then the accessible phase space is bounded by the spatial region V(r) < E. 
With this in mind, we have 


Theorem: Consider an initial point P in phase space. Then for any neighbourhood 
Do of P, there exists a point P’ € Do that will return to Do in a finite time. 


Proof: Consider the evolution of Dp over a finite time interval T. Hamilton’s 
equations provide a map Do +» Dı shown in figure 55. By Liouville’s theorem, we 
know that Vol(Do) = Vol( Dı), although the shapes of these two regions will in general 
be different. Let Dp be the region after time kT where k is an integer. Then there 
must exist integers k and k’ such that the intersection of D; and D, is not empty: 


Dy Dy Ao (4.59) 
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Figure 55: The Hamiltonian map in a time step T. 


(If this isn’t true then the total volume Uo Dk — 00 but, by assumption, the phase 
space volume is finite). Take k’ > k such that wkw = Dp N Dye #4 $. But since 
the Hamiltonian mapping Dk — D 4+, is invertible, we can track backwards to find 


Wo,k'—-k = Do N Dwr # 9. 


De- 


Do 


Figure 56: Figure 57: 


So some point P’ € Do has returned to D in k’ — k time steps T. 


What does the Poincaré recurrence theorem mean? Consider gas molecules all in 
one corner of the room. If we let them go, they fill the room. But this theorem tells us 
that if we wait long enough, they will all return once more to the corner of the room. 
The trick is that the Poincaré recurrence time for this to happen can easily be longer 


than the lifetime of the universe! 


Figure 58: Eventually all the air molecules in a room will return to one corner. 


Question: Where’s your second law of thermodynamics now?! 
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4.3 Poisson Brackets 


In this section, we'll present a rather formal, algebraic description of classical dynamics 
which makes it look almost identical to quantum mechanics! We’ll return to this 
analogy later in the course. 


We start with a definition. Let f(q,p) and g(q,p) be two functions on phase space. 
Then the Poisson bracket is defined to be 


tf ja Og Of Og 
a OG: Op; Op; Oi 


Since this is a kind of weird definition, let’s look at some of the properties of the Poisson 
bracket to get a feel for it. We have 


s {f.g} = —{g, f}. 
e linearity: {af + 8g, h} =a{ f, h} + B{g,h} for alla, b E R. 


e Leibniz rule: {fg,h} = f{g,h} + {f,h}g which follows from the chain rule in 
differentiation. 


(4.60) 


e Jacobi identity: {f,{g,h}} + {g, {h, f}} + {h,{f,g}} = 0. To prove this you 
need a large piece of paper and a hot cup of coffee. Expand out all 24 terms and 
watch them cancel one by one. 


What we've seen above is that the Poisson bracket {, } satisfies the same algebraic 
structure as matrix commutators |, | and the differentiation operator d. This is related 
to Heisenberg’s and Schrédinger’s viewpoints of quantum mechanics respectively. (You 
may be confused about what the Jacobi identity means for the derivative operator d. 
Strictly speaking, the Poisson bracket is like a “Lie derivative” found in differential 
geometry, for which there is a corresponding Jacobi identity). 


The relationship to quantum mechanics is emphasised even more if we calculate 


{Gia} = 0 
{p pj} = 0 (4.61) 
{qi pi} = 64; 
We'll return to this in section 4.8. 
Claim: For any function f(q,p,t), 
df Of 
— = H _— 4.62 
TEE (4.62) 
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Proof: 
df _ ðf. öf. of 
dt p aq" Ot 
of OH -OF:0H əðf 
Op; Og; OG Op; ` At 


(4.63) 


| 
polen 
Sh 
X 
—) 
+ 

| 


Isn’t this a lovely equation! One consequence is that if we can find a function (p,q) 
which satisfy 


{Z, H} =0 (4.64) 


then J is a constant of motion. We say that J and H Poisson commute. As an example 
of this, suppose that q; is ignorable (i.e. it does not appear in H) then 


{p:;, H} = 0 (4.65) 


which is the way to see the relationship between ignorable coordinates and conserved 
quantities in the Poisson bracket language. 


Note that if J and J are constants of motion then {{7, J}, H} = {/,{J,H}}4+ 
{{I, H}, J} = 0 which means that {J, J} is also a constant of motion. We say that the 
constants of motion form a closed algebra under the Poisson bracket. 


4.3.1 An Example: Angular Momentum and Runge-Lenz 
Consider the angular momentum L = r x p which, in component form, reads 
Li = rapa — r3pa , Lo=rspi-mips , L3=ripa—repi (4.66) 
and let’s look at the Poisson bracket structure. We have 
{L1, Lo} = {r2P3 — T3p2,T3P1 — T1P3} 


= {rops,rspi} + {rsp2, Tips} 
= —ropit peri = L3 (4.67) 


So if ZL, and Ly are conserved, we see that L3 must also be conserved. Or, in other 
words, the whole vector L is conserved if any two components are. Similarly, one can 
show that 


{L L;} =0 (4.68) 


where L? = $, L?. This should all be looking familiar from quantum mechanics. 
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Another interesting object is the (Hermann-Bernoulli-Laplace-Pauli-) Runge-Lenz 
vector, defined as 


1 
A=—pxL-? (4.69) 
m 


where ĉ = r/r. This vector satisfies A - L = 0. If you’re willing to spend some time 
playing with indices, it’s not hard to derive the following expressions for the Poisson 
bracket structure 


2 1 
{La, Ay} = EabcAe ; {Aa, Ay} = TIR (= = 3 EabcLe (4.70) 


The last of these equations suggests something special might happen when we consider 
the familiar Hamiltonian H = p?/2m — 1/r so that the Poisson bracket becomes 


2H 
{Aa, Ao} = Z7 EL (4.71) 
Indeed, for this choice of Hamiltonian is a rather simple to show that 
{H,A}=0 (4.72) 


So we learn that the Hamiltonian with —1/r potential has another constant of motion A 
that we’d previously missed! The fact that A is conserved can be used to immediately 
derive Kepler’s elliptical orbits: dotting A with ĉ yields ê- A + 1 = L?/r which is the 
equation for an ellipse. Note that the three constants of motion, L, A and H form a 
closed algebra under the Poisson bracket. 


Noether’s theorem tells us that the conservation of L and H are related to rotational 
symmetry and time translation respectively. One might wonder whether there’s a 
similar symmetry responsible for the conservation of A. It turns out that there is: the 
Hamiltonian has a hidden SO(4) symmetry group. You can read more about this in 
Goldstein. 

4.3.2 An Example: Magnetic Monopoles 


We’ve seen in the example of section 4.1.3 that a particle in a magnetic field B= Vx A 
is described by the Hamiltonian 


Te > E (4.73) 


where, as usual in the Hamiltonian, r is to be thought of as a function of r and p. It’s 
a simple matter to compute the Poisson bracket structure for this system: it reads 


{Mra Mro} = € EabcBe ; {Mra rp} = —dab (4.74) 
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Let’s now use this to describe a postulated object known as a magnetic monopole. It’s 
a fact that all magnets ever discovered are dipoles: they have both a north and south 
pole. Chop the magnet in two, and each piece also has a north and a south pole. 
Indeed, this fact is woven into the very heart of electromagnetism when formulated in 
terms of the gauge potential A. Since we define B = V x A, we immediately have one 
of Maxwell’s equations, 


V-B=0 (4.75) 


which states that any flux that enters a region must also leave. Or, in other words, 
there can be no magnetic monopole. Such a monopole would have a radial magnetic 
field, 
r 

B=g 3 (4.76) 
which doesn’t satisfy (4.75) since it gives rise to a delta function on the right-hand side. 
So if magnetic monopoles have never been observed, and are forbidden by Maxwell’s 
equations, why are we interested in them?! The point is that every theory that goes 
beyond Maxwell’s equations and tries to unify electromagnetism with the other forces 
of Nature predicts magnetic monopoles. So there’s reason to suspect that, somewhere 
in the universe, there may be particles with a radial magnetic field given by (4.76)°. 


What happens if an electron moves in the background of a monopole? It’s tricky 
to set up the Lagrangian as we don’t have a gauge potential A. (Actually, one can 
work with certain singular gauge potentials but we won’t go there). However, we can 
play with the Poisson brackets (4.74) which contain only the magnetic field. As an 
application, consider the generalised angular momentum, 


J = mr x t — ger (4.77) 


where ĉ = r/r. For g = 0, this expression reduces to the usual angular momentum. It is 
a simple matter to show using (4.74) that in the background of the magnetic monopole 
the Hamiltonian H = $mr? and J satisfy 


{H, J} = 0 (4.78) 


which guarantees that J is a constant of motion. What do we learn from this? Since 
J is conserved, we can look at r- J = —eg to learn that the motion of an electron in 
the background of a magnetic monopole lies on a cone of angle cos@ = eg/J pointing 
away from the vector J. 


>You can read more about magnetic monopoles in the lectures on Electromagnetism, the lectures 
on Solid State Physics and the lectures on Gauge Theory. See, they really do get everywhere! 
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4.3.3 An Example: The Motion of Vortices 


The formal structure of Poisson brackets that we’ve introduced here can be employed 
even when it’s not obvious that we’re talking about coordinates and momenta. To 
illustrate this, consider the rather odd motion of line vortices moving in a plane. For n 
vortices with positions r; = (2;,y;), each with strength y;, the equations of motion are 


Ai T SN a 4S e- — r: j 
j+i 
j#t 
where there is no sum over 2 on the left hand side of 
these equations. Notice that these are first order equa- | 
tions for the position variables, rather than for position | 


and momentum. How can we cast this dynamics in a 


Hamiltonian framework? The trick is to consider one of 3E í 
the positions as a “canonical momentum”. We consider | 


the Hamiltonian 


H=- So y log |r; — r;| (4.80) 


i<j 


Figure 59: 


which is endowed with the Poisson bracket structure 


Of Og Əf Og 
{f,9} = = “(3 ee a) (4.81) 


In particular, we have 


Ôi; 
tapi = = (4.82) 
‘Vi 
Using this, we can check that Hamilton’s equations reproduce the equations of motion, 
1 OH 
i = {ra H} = = 
Vi Oyi 
; 1 OH 
Yi = (yi, H} = -—— (4.83) 
Ji OF; 


The system has a number of conserved quantities. Firstly, there is the “total momen- 


tum”, 
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which satisfy {P,,H} = {P,,H} = 0, ensuring that they are conserved quantities. 
We also have {P,, Py} = >°,7% and the right hand side, being constant, is trivially 
conserved. 


The other conserved quantity is the “total angular momentum” , 
T= —-5)) wile? +97) (4.85) 


which again satisfies {J,H} = 0, ensuring it is conserved. The full algebra of the 
conserved quantities includes {P,,J} = —P, and {P}, J} = P,, so the system closes 
(meaning we get back something we know on the right hand side). In fact, one can 
show that H, J and (P? + P2) provide three mutually Poisson commuting conserved 
quantities. 


So what is the resulting motion of a bunch of vortices? For two vortices, we can 
simply solve the equations of motion to find, 


Yı — Yo = R cos (Ft — to) ) (4.86) 


where R is the separation between the vortices and w = (y1 + %2)/R?. So we learn 
that two vortices orbit each other with frequency inversely proportional to the square 
of their separation. 


For three vortices, it turns out that there is a known solution which is possible 
because of the three mutually Poisson commuting conserved quantities we saw above. 
We say the system is “integrable”. We’ll define this properly shortly. For four or more 
vortices, the motion is chaotic®. 


You may think that the Poisson bracket structure {x,y} 4 0 looks a little strange. 
But it also appears in a more familiar setting: a charged particle moving in a magnetic 
field B = (0,0, B). We saw this example in section 4.1.3, where we calculated 


Pr = Mt — — y (4.87) 


For large magnetic fields the second term in this equation dominates, and we have 
Pz ~% —eBy/mce. In this case the Poisson bracket is 


{t,pPes}=1 => {x,y} -— (4.88) 


°For more details on this system, see the review H. Aref, “Integrable, Chaotic, and Turbulent Vortex 
Motion in Two-Dimensional Flows”, Ann. Rev. Fluid Mech. 15 345 (1983). 
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This algebraic similarity between vortices and electrons is a hot topic of current re- 
search: can we make vortices do similar things to electrons in magnetic fields? For 
example: will vortices in a Bose-Einstein condensate form a fractional quantum Hall 
state? This is currently an active area of research. 


4.4 Canonical Transformations 


There is a way to write Hamilton’s equations so that they look even more symmetric. 
)\T and the 2n x 2n matrix J, 


0 1 
af is 


where each entry is itself an nxn matrix. Then with this notation, Hamilton’s equations 
read 


Define the 2n vector x = (1,---;Qn;P1;-++5Pn 


x= J— (4.90) 


Now remember that in the Lagrangian formalism we made a big deal about the fact that 
we could change coordinates q; > Q;(q) without changing the form of the equations. 
Since we’ve managed to put q; and p; on an equal footing in the Hamiltonian formalism, 
one might wonder if its possible to make an even larger class of transformations of the 
form, 


qi > Qilq, p) and p; > P,(q,p) (4.91) 


The answer is yes! But not all such transformations are allowed. To see what class of 
transformations leaves Hamilton’s equations invariant, we use our new symmetric form 
in terms of x and write the transformation as 


Li > yilx) (4.92) 

Note that we’ll continue to use the index 7 which now runs over the range i = 1,..., 2n. 
We have 

Woy T 4 (4.93) 


ET Ox; iA Ox; jk Oy, OL, 
or, collating all the indices, we have 


pn r OH 
y=(IJIT ) Dy (4.94) 
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where Ji; = Oy;/Ox; is the Jacobian that we met in section 4.2. We see that Hamilton’s 
equations are left invariant under any transformation whose Jacobian J satisfies 


IJIET =J = a ka = E (4.95) 
j k 


The Jacobian J is said to be symplectic if this holds. A change of variables with a 
symplectic Jacobian is said to be a canonical transformation. 


There is a nice method to construct canonical transformations using “generating 
functions” which we will mention in section 4.4.3. Before we get to this, let’s look 
at some uses. We start by proving a theorem relating canonical transformations with 
Poisson brackets. 


Theorem: The Poisson bracket is invariant under canonical transformations. Con- 
versely, any transformation which preserves the Poisson bracket structure so that 


{Qi, Qj} = {P Pj} =0 and {Qi, Pi} = ôi (4.96) 


is canonical. 


Proof: Let’s start by showing that the Poisson bracket is invariant under canonical 
transformations. Consider two functions f(x;) and g(x;). Then, 


if ied Og Of Og 
PES Ogi Opi Opi Ogi 
of Og 
Ox; Fig Ons ( ) 
So if x + y(x), we have 
Of _ Of 
= — Sui 4. 
Ozi OYE Sri (598) 
and, assuming the transformation is canonical, the Poisson bracket becomes 
of Og of Og 
= 5 Sri Jij Sij = 4. 
{f,g} OUR Ir Jij Jij Oy Yk Jri Oy ( 99) 


This means that we can compute our Poisson brackets in any coordinates related by a 
canonical transformation. Now let’s show the converse. Go back to the notation (qi, pi) 
and the new coordinates (Q;(q, p), P:(q, p)). The Jacobian is given by 


ee ee a 


= (4.100) 
ƏP,/ðq; ƏP;/ðp; 
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If we now compute JJ J" in components, we get 


{QQ} a (4.101) 


IJI”) = 
o {P Pi} 


So whenever the Poisson bracket structure is preserved, the transformation is canonical. 


Example 


In the next section we’ll see several non-trivial examples of canonical transformations 
which mix up q and p variables. But for now let’s content ourselves with reproducing 
the coordinate changes that we had in section 2. Consider a change of coordinates of 
the form 


qi > Qila) (4.102) 


We know that Lagrange’s equations are invariant under this. But what transformation 
do we have to make on the momenta 


pi > P;(q, p) (4.103) 
so that Hamilton’s equations are also invariant? We write O;; = 0Q;/0q; and look at 


O; 0 
Fy = i (4.104) 
OP,/0q; OP;/Op; 


the Jacobian 


in order for the transformation to be canonical, we require JJ 7? = J. By expanding 
these matrices out in components, we see that this is true if 


This is as we would expect, for it’s equivalent to P; = AL/OQ;. Note that although 
Qi = Qi(q) only, P; 4 P;(p). Instead, the new momentum P; depends on both q and p. 


4.4.1 Infinitesimal Canonical Transformations 


Consider transformations of the form 


qi > Qi = qi + QF;(q, p) 
pi > P; = pi + aFi(q,p) (4.106) 
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where a is considered to be infinitesimally small. What functions F;(q, p) and E;(q, p) 
are allowed for this to be a canonical transformation? The Jacobian is 


Oi; OF; /0q; OF; /Op; 
i= jta / qj Q i Pj (4.107) 
Q OE; /0q; Oi; +a OE; /Op; 
so the requirement that JJ J? = J gives us 
OF; OE; 
=— 4.108 
0g; Op; oe 
which is true if 
- and =F, = = (4.109) 


— Opi p qi 
for some function G(q,p). We say that G generates the transformation. 


This discussion motivates a slightly different way of thinking about canonical trans- 
formations. Suppose that we have a one-parameter family of transformations, 


qi > Qi(q,p;a) and p; > P,(q,p;a) (4.110) 


which are canonical for all a € R and have the property that Q;(q, p;a = 0) = q and 
P;(q,p;&œ = 0) = p;. Up until now, we’ve been thinking of canonical transformations in 
the “passive” sense, with the (Q;, P;) labelling the same point in phase space as (q;, pi), 
just in different coordinates. But a one-parameter family of canonical transformations 
can be endowed with a different interpretation, namely that the transformations take 
us from one point in the phase space (q,p;) to another point in the same phase space 
(Q:(q,p; a), P(q,p;a)). In this “active” interpretation, as we vary the parameter a 
we trace out lines in phase space. Using the results (4.106) and (4.109), the tangent 
vectors to these lines are given by, 


dqi = OG and dp; E _ OG 
da Op; da (04; 


But these look just like Hamilton’s equations, with the Hamiltonian replaced by the 


(4.111) 


function G and time replaced by the parameter a. What we’ve found is that every 
one-parameter family of canonical transformations can be thought of as “Hamiltonian 
flow” on phase space for an appropriately chosen “Hamiltonian” G. Conversely, time 
evolution can be thought of as a canonical transformation for the coordinates 


(qi(to), pi(to)) > (gilt), pi(t)) (4.112) 


generated by the Hamiltonian. Once again, we see the link between time and the 
Hamiltonian. 
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As an example, consider the function G = pp. Then the corresponding infinitesimal 
canonical transformation is qi > qi + adj, and p; —> pi, which is simply a translation. 
We say that translations of qg are generated by the conjugate momentum G = pr. 


4.4.2 Noether’s Theorem Revisited 


Recall that in the Lagrangian formalism, we saw a connection between symmetries and 
conservation laws. How does this work in the Hamiltonian formulation? 


Consider an infinitesimal canonical transformation generated by G. Then 


6H = 5 b+ Sop 
ƏH ðG ƏHƏG n3 
ü ðqi Opi Pi Op; Odi ee 
=a{H,G} (4.113) 


The generator G is called a symmetry of the Hamiltonian if 6H = 0. This holds if 
{G,H}=0 (4.114) 


But we know from section 4.3 that G = {G, H}. We have found that if G is a symmetry 
then G is conserved. Moreover, we can reverse the argument. If we have a conserved 
quantity G, then we can always use this to generate a canonical transformation which 
is a symmetry. 


4.4.3 Generating Functions 


There’s a simple method to construct canonical transformations between coordinates 
(qi, pi) and (Q;, P;). Consider a function F(q, Q) of the original q;’s and the final Q,’s. 
Let 
oF 
Pi = Oa: 
After inverting, this equation can be thought of as defining the new coordinate Q; = 
Qi(q,p). But what is the new canonical momentum P? We’ll show that it’s given by 


OF 
P, = -— = 4.116 
00. aes 
The proof of this is a simple matter of playing with partial derivatives. Let’s see how 
it works in an example with just a single degree of freedom. (It generalises trivially to 


the case of several degrees of freedom). We can look at the Poisson bracket 
OP 0Q| OP 
p P|, OP |, OF 


(4.115) 


(4.117) 


o 
QP 


P 
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At this point we need to do the playing with partial derivatives. Equation (4.116) 
defines P = P(q, Q), so we have 


OP OQ| OP OP OP OQ| OP 
= and —| = —| +—| — 4.118 
Op |, Op |, OQ |, ðq |, ðq |o ôq |, 9Q|, ( ) 
Inserting this into the Poisson bracket gives 
oQ| oP 0Q| OF OQ| Op 
Piss | s = =1 4.119 
== al a laa lR, A 


as required. The function F(q, Q) is known as a generating function of the first kind. 


There are three further types of generating function, related to the first by Leg- 
endre transforms. Each is a function of one of the original coordinates and one of 
the new coordinates. You can check that the following expression all define canonical 


transformations: 
, B oð Fo B OF, 
F4(q, P) : Pi = I; and Q;= DP, (4.120) 
OF: OF: 
F3(p,Q): qi=- a and A= = 0, 
B OF, B OF, 
F(p, P) : qi = — Op, and Qi = OP, 


4.5 Action-Angle Variables 


We've all tried to solve problems in physics using the wrong coordinates and seen 
what a mess it can be. If you work in Cartesian coordinates when the problem really 
requires, say, spherical polar coordinates, it’s always possible to get to the right answer 
with enough perseverance, but you’re really making life hard for yourself. The ability 
to change coordinate systems can drastically simplify a problem. Now we have a much 
larger set of transformations at hand; we can mix up q’s and p’s. An obvious question 
is: Is this useful for anything?! In other words, is there a natural choice of variables 
which makes solving a given problem much easier. In many cases, there is. They’re 
called “angle-action” variables. 


4.5.1 The Simple Harmonic Oscillator 


We’ll start this section by doing a simple example which will illustrate the main point. 
We’ll then move on to the more general theory. The example we choose is the simple 
harmonic oscillator. Notice that as our theory gets more abstract, our examples get 
easier! 
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We have the Hamiltonian p 


p’ 1 22 
H= 4- 4.121 
at ga ( ) 


so that Hamilton’s equations are the familiar ete 


p=—muw’q and g= a (4.122) 
m 
which has the rather simple solution Figure 60: 
q = Acos(w(t — to)) and p=—mwAsin(w(t — to)) (4.123) 


where A and to are integration constants. The flows in phase space are ellipses as shown 
in the figure. 


Now let’s do a rather strange change of variables in which we use our freedom to mix 
up the position and momentum variables. We write 


(q, p) > (0, I) (4.124) 


where you can think of 0 is our new position coordinate and J our new momentum 
coordinate. The transformation we choose is: 


21 
q=4\/—— sinf and p=vV2Imw cosé (4.125) 
Mw 


It’s an odd choice, but it has advantages! Before we turn to these, let’s spend a minute 
checking that this is indeed a canonical transformation. There’s two ways to do this 
and we'll do both: 


1) We can make sure that the Poisson brackets are preserved. In fact, it’s easier to 
work backwards and check that {q, p} = 1 in (0, I) coordinates. In other words, we 
need to show that 

_ Og Op _ Og Op 
torien = 39 aF Br a ~ 


To confirm this, let’s substitute the transformation (4.125), 


| 21 
{q, pron = i — sind, V2Imw ono} 
Mu 
(0,1) 


=9 {VT sinð, VI coso} =i (4.127) 


(0,1) 


1 (4.126) 


where the final equality follows after a quick differentiation. So we see that the trans- 
formation (4.125) is indeed canonical. 
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2) The second way to see that the transformation is canonical is to prove that the 
Jacobian is symplectic. Let’s now check it this way. We can calculate 


(sa a E ~ cos? 0 ee) (4.128) 


OI /0q OL /Op mwq p/mw 
from which we can calculate JJJT and find that it is equal to J as required. 


So we have a canonical transformation in (4.125). But what’s the point of doing 
this? Let’s look at the Hamiltonian in our new variables. 


1 1 21 
H = —(2mwT) sin? 0 + = mw? — cos? 0 = wI (4.129) 
2m 2 mu 
so the Hamiltonian doesn’t depend on the variable 0! This means 1 
that Hamilton’s equations read > 
. OH . OH 7 
l= = d f[=-—=0 4.130 
io a0 abi, 
We’ve managed to map the phase space flow onto a cylinder param- 
eterised by 0 and J so that the flows are now all straight lines as > 
shown in the figure. The coordinates (0, T) are examples of angle- *® J 
action variables. Figure 61: 


4.5.2 Integrable Systems 


In the above example, we saw that we could straighten out the flow lines of the simple 
harmonic oscillator with a change of variables, so that the motion in phase space became 
trivial. It’s interesting to ask if we can we do this generally? The answer is: only for 
certain systems that are known as integrable. 


Suppose we have n degrees of freedom. We would like to find canonical transforma- 
tions 


GD) U (4.131) 


such that the Hamiltonian becomes H = H(Å,..., In) and doesn’t depend on 6;. If we 
can do this, then Hamilton’s equations tell us that we have n conserved quantities {;, 
while 

“ƏL ~” 


where w; is independent of 0 (but in general depends on J) so that the solutions are 


6; (4.132) 


simply 6; = w;t. Whenever such a transformation exists, the system is said to be 
integrable. For bounded motion, the 6; are usually scaled so that 0 < 6; < 27 and the 
coordinates (6;, [;) are called angle-action variables. 
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Liouville’s Theorem on Integrable Systems: There is a converse statement. 
If we can find n mutually Poisson commuting constants of motion ;,..., J, then this 
implies the existence of angle-action variables and the system is integrable. The re- 
quirement of Poisson commutation {J;, /;} = 0 is the statement that we can view the 
I; as canonical momentum variables. This is known as Liouville’s theorem. (Same 
Liouville, different theorem). A proof can be found in the book by Arnold. 


Don’t be fooled into thinking all systems are integrable. They are rather special and 
precious. It remains an active area of research to find and study these systems. But 
many — by far the majority — of systems are not integrable (chaotic systems notably 
among them) and don’t admit this change of variables. Note that the question of 
whether angle-action variables exist is a global one. Locally you can always straighten 
out the flow lines; it’s a question of whether you can tie these straight lines together 
globally without them getting tangled. 


Clearly the motion of a completely integrable system is restricted to lie on J; = 
constant slices of the phase space. A theorem in topology says that these surfaces must 
be tori (S1 x ... x St) known as the invariant tori. 


4.5.3 Action-Angle Variables for 1d Systems 


Let’s see how this works for a 1d system with Hamil- ivo 
tonian 


M= +V(q) (4.133) 


Since H itself is a constant of motion, with H = E for 


some constant Æ throughout the motion, the system is 
integrable. We assume that the motion is bounded so that 
qı < q < q as shown in the figure. Then the motion is 


Figure 62: 


periodic, oscillating back and forth between the two end points, and the motion in phase 
space looks something like the figure 63. Our goal is to find a canonical transformation 
to variables 0 and J that straightens out this flow to look like the second figure in the 
diagram. 


So what are J and 0? Since J is a constant of motion, it should be some function of 
the energy or, alternatively, 


H=H(I)=E (4.134) 
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Figure 63: Can we straighten out the flow lines in phase space? 


But which choice will have as its canonical partner @ € [0, 27) satisfying 
Fi OH OE 
Ol al 


for a constant w which is the frequency of the orbit? 


=w (4.135) 


Claim: The correct choice for I is 
1 
[=— d 4.136 
= f p dq ( ) 


which is the area of phase space enclosed by an orbit (divided by 27) and is a function 
of the energy only. 


Proof: Since the Hamiltonian is conserved, we may write the momentum as a function 
of q and E: 


p= V2m yE — V(q) (4.137) 


We know that for this system p = mq so we have 


(4.138) 


oà m dq 
en oc) 


Integrating over a single orbit with period T = 27/w, we have 


a 7 \rf VE =a 
= $ Vin (2 VE=Va) ay (4.139) 


— LOO 


At this point we take the differentiation d/dE outside the integral. This isn’t obviously 
valid since the path around which the integral is evaluated itself changes with energy 
E. Shortly we’ll show that this doesn’t matter. For now, let’s assume that this is valid 
and continue to find 


= 2r — (4.140) 


where in the last line, we’ve substituted for our putative action variable 7. Examining 
our end result, we have found that J does indeed satisfy 
dE 
dI 


where w is the frequency of the orbit. This is our re- 


=w (4.141) 


quired result, but it remains to show that we didn’t 
miss anything by taking d/dE outside the integral. 
Let’s think about this. We want to see how the area 
enclosed by the curve changes under a small shift in 


energy ÔE. Both the curve itself and the end points 
qı < q < q@ vary as the energy shifts. The latter 
change by ôq; = (dV(q;)/dq) 1 OE. Allowing the dif- 
ferential d/dE to wander inside and outside the in- 


tegral is tantamount to neglecting the change in the Figure 64: 
end points. The piece we’ve missed is the small white 
region in the figure. But these pieces are of order 0E?. To see this, note that order 6E 


pieces are given by 


qi 


V2mV E — V (q) dq ~ V2my E — V (q) (X) E: (4.142) 


qi+ôqi 


evaluated at the end point q = qi. They vanish because E = V(q;) at the end points. 


This completes the proof. 


This tells us that we can calculate the period of the orbit w by figuring out the area 
enclosed by the orbit in phase space as a function of the energy. Notice that we can 
do this without ever having to work out the angle variable 0 (which is a complicated 
function of q and p) which travels with constant speed around the orbit (i.e. satisfies 
0 = wt). 
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In fact, it’s not too hard to get an expression for 0 by going over the above analysis 
for a small part of the period. It follows from the above proof that 


d 


"=E 


pdq (4.143) 


but we want a 0 which obeys 6 = wt. We see that we can achieve this by taking the 
choice 


d dE d d 
=u ou- Te foui | ray (4.144) 


Because F is conserved, all 1d systems are integrable. What about higher dimen- 
sional systems? If they are integrable, then there exists a change to angle-action vari- 
ables given by 


1 
l = mf. 2, it 


o 
s= pe (4.145) 


where the q; are the periods of the invariant tori. 


4.5.4 Action-Angle Variables for the Kepler Problem 


Perhaps the simplest integrable system with more than one degree of freedom is the 
Kepler problem. This is a particle of mass m moving in three dimensions, subject to 
the potential 


V(r) = E: 

m 
We solved this already back in the Dynamics and Relativity course. Recall that we can 
use the conservation of the (direction of) angular momentum to restrict dynamics to 
a two-dimensional plane. We’ll work in polar coordinates (r, ø) in this spatial plane. 
The associated momenta are p, = mr and pg = mr?ġ. The Hamiltonian is 


1 1 k 
H= 24 4 
Pr 2mr? Po r 


(4.146) 


There are two action variables, one associated to the radial motion and one associated 
to the angular motion. The latter is straightforward: it is the angular momentum itself 


il 2T 
I z2 Cl d = 
$ F podo = pọ 


== 


The action variable for the radial motion is more interesting. We can calculating it 
by using the fact that the total energy, Æ, and the angular momentum Tọ are both 
conserved. Then, rearranging (4.146), we have 


and the action variable is 


1 1 Tmax 1 Tmax k I5 
lp= — rdr = — 2 dr= == 2 2 E+- ]|-——d 
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Here rmin and fmax are, respectively, the closest and furthest distance to the origin. 
(If you try to picture this in space, you’ll need to recall that in the Kepler problem 


the origin sits on the focus of the ellipse, rather than the centre; this means that the 
smallest and furthest distance are opposite each other on the orbit).The factor of 2 in 
the second equality comes because a complete cycle goes from fmin tO Tmax and back 
again. To do this integral, you'll need the result 


r r 


f y0 = Tin) (= = 1) = 5 (ram T fmax) — T/T min max 


Using this, we find 


m 
Lal akan 1 
AE? 
Or, re-arranging, 
mk? 
E = -—_ 4.147 
2( Le + La)? ( ) 


There’s something rather nice lurking in this result. The energy is the same as the 
Hamiltonian in this case and we can use it to compute the speed at which the angular 
variables change. This follows from Hamilton’s equations, 


: OH . OH 
O, = ðI, and 0g = OL, 


Here 0, = ¢ while 6, is some complicated function of r. But we see from (4.147) that 
the Hamiltonian is symmetric in J, and Jy. This means that the frequency at which the 
particle completes a @ cycle is the same frequency with which it completes a 6, cycle. 
But that’s the statement that the orbit is closed: when you go around 27 in space, you 
come back to the same r value. The existence of closed orbits is a unique feature of 
the 1/r potential. The calculation reveals the underlying reason for this. 


= a= 


4.6 Adiabatic Invariants 


Consider a 1d system with a potential V(q) that de- Ivo 
pends on some parameter A. If the motion is bounded by 


the potential then it is necessarily periodic. We want to 
ask what happens if we slowly change A over time. For 


oy 


example, we may slowly change the length of a pendulum, 


or the frequency of the harmonic oscillator. 


Since we now have \ = X(t), the energy is not conserved. 


Figure 65: 
Rather E = E(t) where 


. OH. 

E = ar (4.148) 
But there are combinations of Æ and A which remain (approximately) constant. These 
are called adiabatic invariants and the purpose of this section is to find them. In fact, 
we’ve already come across them: we’ll see that the adiabatic invariants are the action 


variables of the previous section. 


For the 1d system, the Hamiltonian is 
2 


H= 4V(qA(t)) (4.149) 


2m 
and we claim that the adiabatic invariant is 


1 
To d Al 
=r q (4.150) 


where the path in phase space over which we integrate now depends on time and is 
given by p = V2m,/E(t) — V (q; A(t)). The purpose of this section is to show that T 
is indeed an adiabatic invariant. At the same time we will also make clearer what we 


mean when we say that \ must change slowly. 


Let’s start by thinking of J as a function of the energy E and the parameter A. As 
we vary either of these, J will change. We have, 


_ Ol Ol 
~ BE|,~* ar 
where the subscripts on the partial derivatives tell us what variable we’re keeping fixed. 
For an arbitrary variation of EF and A, this equation tells us that J also changes. But, 
of course, E and A do not change arbitrarily: they are related by (4.148). The point of 
the adiabatic invariant is that when È and À are related in this way, the two terms in 


I À (4.151) 


E 


(4.151) approximately cancel out. We can deal with each of these terms in turn. The 
first term is something we’ve seen previously in equation (4.141) which tells us that, 
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Or 1 T(A) 
= = 4.152 
Ej,  w(A) 2T ( ) A AS 
a o 
where T (A) is the period of the system evaluated at fixed A. 
The second term in (4.151) tells us how the path changes a 


as A is varied. For example, two possible paths for two 


different \’s are shown in the figure and the change in I is 


the change in the area of under the two curves. We have Figure 66: 
ðI Ld Lf 1 (7 Op| ƏH 
z $ paa E E ee Pl CH) dt! (4.153) 
OA\\, 27 d|p 2r J OAs 2m Jo jp Op |y 


where, in the second equality, we have neglected a contribution arising from the fact 
that the path around which we integrate changes as À changes. But this contribution 
can be safely ignored by the same argument given around (4.142). 


We can get a simple expression for the product of partial derivatives by differentiating 
the Hamiltonian and remembering what depends on what. We have the expression 
H(q,p, A) = E where, in the left-hand side we substitute p = V2m,/E(t) — V (q; A(t). 
Then differentiating with respect to À, keeping E (and q) fixed, we have 


OH OH| Op 
| =0 (4.154) 
OA}, p|, OAlp 
So substituting this into (4.153) we have 
al 1 /7™ oF 
ae |e = : 4.1 
al mh l Piek] 


So putting it all together, we have the time variation of J given by 


T(A) j 
z l E \ (4.156) 
m 0 OX |k 2T 


where, in the first term, we’ve replaced E with the expression (4.148). Now we’re 
almost done. So far, each term on the right-hand side is evaluated at a given time t or, 
correspondingly for a given A(t). The two terms look similar, but they don’t cancel! 


OH 


TO) 


But we have yet to make use of the fact that the change in A is slow. At this point 
we can clarify what we mean by this. The basic idea is that the speed at which the 
particle bounces backwards and forwards in the potential is much faster than the speed 
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at which A changes. This means that the particle has performed many periods before 
it notices any appreciable change in the potential. This means that if we compute 
averaged quantities over a single period, 


(A(Q)) = 5 | AG, d) dt (4.157) 


then inside the integral we may treat AÀ as if it is effectively constant. We now consider 
the time averaged motion (I ). Since À can be taken to be constant over a single period, 
the two terms in (4.156) do now cancel. We have 


(f)=0 (4.158) 
This is the statement that J is an adiabatic invariant: for small changes in A, the 
averaged value of J remains constant’. 


The adiabatic invariants played an important role in the early history of quantum 
mechanics. You might recognise the quantity J as the object which takes integer values 
according to the old 1915 Bohr-Sommerfeld quantisation condition 


1 
= f pag = nh neZ (4.159) 
T 


The idea that adiabatic invariants and quantum mechanics are related actually predates 
the Bohr-Somerfeld quantisation rule. In the 1911 Solvay conference Einstein answered 
a question of Lorentz: if the energy is quantised as Æ = hnw where n € Z then what 
happens if w is changed slowly? Lorentz’ worry was that integers cannot change slowly 
— only by integer amounts. Einstein’s answer was not to worry: E/w remains constant. 
These days the idea of adiabatic invariants in quantum theory enters into the discussion 
of quantum computers. 


An Example: The Simple Harmonic Oscillator 


We saw in section 4.5 that for the simple harmonic oscillator we have I = E/w. So 
if we change w slowly, then the ratio E/w remains constant. This was Einstein’s 1911 
point. In fact, for the SHO it turns out that there is an exact invariant that remains 
constant no matter how quickly you change w and which, in the limit of slow change, 
goes over to J. This exact invariant is 
r=4| + (oa— O (4.160) 
2 | g(t)? 


’The proof given above is intuitive, but begins to creak at the seams when pushed. A nice descrip- 


tion of these issues, together with a more sophisticated proof using generating functions for canonical 
transformations is given in in the paper “The Adiabatic Invariance of the Action Variable in Classical 
Dynamics” by C.G.Wells and S.T.Siklos which can be found at http://arxiv.org/abs/physics/0610084. 
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where g(t) is a function satisfying the differential equation 
1 
g+w(t)g-—= =0 (4.161) 
g 
4.6.1 Adiabatic Invariants and Liouville’s Theorem 


There’s a way to think of adiabatic invariants using Li- p 
ouville’s theorem. Consider first a series of systems, all 

described by a Hamiltonian with fixed parameter A. We 

set off each system with the same energy E or, equiva- a 
lently, the same action J, but we start them with slightly iy i 
different phases 0. This means that their dynamics is de- 


scribed by a series of dots, all chasing each other around a 


fixed curve as shown in the figure. Now let’s think about 

how this train of dots evolves under the Hamiltonian with Figure 67: 

time dependent A(t). Recall that Liouville’s theorem states 

that the area of phase space is invariant under any Hamiltonian evolution. This holds 
whether or not 6H/ðt = 0, so is still valid for the time dependent Hamiltonian with 
A(t). One might be tempted to say that we’re done since all the words sound right: 
Liouville’s theorem implies that the area is conserved which is also the statement that 
our adiabatic invariant J doesn’t change with time. But this is a little too fast! Liou- 
ville’s theorem says the area of a distribution of particles in phase space is conserved, 
not the area enclosed by a perimeter ring of particles. Indeed, Liouville’s theorem holds 
for any variation A(t), not just for adiabatic variations. For a fast change of X(t), there 
is nothing to ensure that the particles that had the same initial energy, but different 
phases, would have the same final energy and we lose the interpretation of a ring of 
dots in phase space enclosing some area. 


The missing ingredient is the “adiabatic principle”. In this context it states that for 
a suitably slow change of the parameter A, all the systems in the same orbit, with the 
same energy, are affected in the same manner. If this holds, after some time the dots 
in phase space will still be chasing each other around another curve of constant energy 
E'. We can now think of a distribution of particles filling the area J inside the curve. 
As varies slowly, the area doesn’t change and the outer particles remain the outer 
particles, all with the same energy. Under these circumstances, Liouville’s theorem 
implies the adiabatic invariant J is constant in time. 


4.6.2 An Application: A Particle in a Magnetic Field 


We saw in Section 4.1 that a particle in a constant magnetic field B = (0,0, B) makes 
circles with Larmor frequency w = eB/mc and a radius R, which depends on the energy 
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of the particle. But what happens if B is slowly varying over space? i.e. B = B(x, y), 
but with 


OiB<«KR (4.162) 
so that the field is roughly constant over one orbit. 


In this example, there is no explicit time dependence of the Hamiltonian so we know 
that the Hamiltonian itself is an exact constant of motion. For a particle in a constant 
magnetic field we can calculate H of an orbit by substituting the solutions (4.32) into 
the Hamiltonian (4.28). We find 

1 e? R? B? 


H = mw? R? = 
2 wi 2m2 


This quantity is conserved. But what happens to the particle? Does it drift to regions 


(4.163) 


with larger magnetic field B, keeping H constant by reducing the radius of the orbit? 
Or to regions with smaller B with larger orbits? 


We can answer this by means of the adiabatic invariant. We can use this because the 
motion of the particle is periodic in space so the particle sees a magnetic field which 
varies slowly over time. The adiabatic invariant is 


1 
I= =f pdq (4.164) 


which is now to be thought of as a line integral along the orbit of the electron. We 
evaluate this on the solution for the uniform magnetic field (4.32) 


1 T 

I=>— | (pet+pyy)dt 
F (Dat + pyġ) 
1 Te 


= (bRw cos wt + mw? R? sin? wt) dt 
2T Jo 


2 2T 
a f sin? 6 d0 (4.165) 
2T Jo 


Setting w = eB/mc, we see that the adiabatic invariant I is proportional to (e/c) BR?. 
Since the electric charge e and the speed of light c are not things we can change, we find 
that BR? is constant over many orbits. But as H ~ B?R? is also conserved, both the 
magnetic field B seen by the particle and the radius of the orbit R must be individually 
conserved. This means the particle can’t move into regions of higher or lower magnetic 
fields: it must move along constant field lines’. 


8For results that go beyond the adiabatic approximations, see the paper by the man: E. Witten 
“A Slowly Moving Particle in a Two-Dimensional Magnetic Field’, Annals of Physics 120 72 (1979). 
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Finally, there’s a simple physical way to see that the particle indeed drifts along lines 
of constant magnetic field. As the particle makes its little Larmor circles, it feels a 
slightly stronger force when it’s, say, at the top of its orbit where the field is slightly 
larger, compared to when its at the bottom. This net force tends to push the particle 
to regions of weaker or stronger magnetic field. But we’ve seen through the use of 
adiabatic invariants that this isn’t possible. The slow drift of the particle acts such 
that it compensates for this small force, keeping the particle on constant field lines. 


There’s a slight variant of the above set-up which allows you to trap charged particles 
using magnetic fields. Consider the particle making its little Larmor circles in the (z, y) 
plane, but also moving in the z direction and take a magnetic field that’s constant in 
the (x, y)-plane, but ever increasing in the z-direction. The energy of the particle is 
given by, 

1 o e? R? B? o1 . IemB 


H = } = Ze 4.166 
gp 2mc2 gc mec ( ) 


Both H > 0 and J > 0 are constant in time. This ensures that there exists a value of 
the magnetic field B > 0 at which we necessarily have z = 0 and the particle can go 
no further. At this stage it turns round and goes back again. By creating a magnetic 
field that increases at two ends, charged particles can be made to bounce back and 
forth in the z direction, while executing their little circles in the (x, y)-plane. It is this 
mechanism that traps charged particles in magnetic loops emitted from the sun and is 
ultimately responsible for solar flares. 


4.6.3 Hannay’s Angle 


Consider a particle bouncing around, with periodic motion, in an arbitrary potential. 
There are many parameters Àa describing the shape of the potential. As we slowly vary 
the A, the path in phase space changes although, as we have seen, the area enclosed 
by the path remains the same. After some time tiong (which, by the assumption of 
adiabiticity, is much longer than the period T of a single orbit) we return to the original 
parameters so that Ag(tong) = Aa(0). The question we want to ask is: how has the 
phase angle 0 changed? 


For any fixed Àa, the velocity of the angle variable is 0 = OH/OI = w(I, `a). As 
we slowly vary the parameters, the particle is spinning around its phase space orbits. 
When we return we therefore expect that the phase has been shifted by f wdt. Which 
is true. But it turns out that there is another, more subtle, contribution to the phase 
shift as well. We’ll now see where this comes from. 
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As the parameters change, we can write the change in the angle 6 as 


) OH | 20 


J= ae TO (4.167) 


Which looks simple enough. But there’s a problem. The second term is not well defined. 
For each set of parameters Àa we have different action angle variables [(\,) and 0(Aa). 
But there’s nothing that stops choosing a different origin 0 = 0 for each choice of the 
parameters. In other words, we could always redefine 


(Aa) => Oa) + B(Aa) (4.168) 


where we shift by a different constant 8 for each Aa. What this means is that it doesn’t 
really make any sense to compare the angle variable for different parameters Aa. This 
makes the second term — which tells us how the angle variable changes as we change 
the parameters — ambiguous. One might think this means that we can just ignore it. 
Or, more precisely, we could choose the shifts 3 so that the angle variables are defined 
in such a way that the second term vanishes. But it turns out that this isn’t possible. 
Let’s see why. The point is that it does make sense to compare the angle variable for 
the same parameters Àa. After such a time tiong, we have 


tlong 
O (tiong) = 0(0) + f wdt + A90 (4.169) 
0 


The term f wdt is the dynamic term that we anticipated above, arising from the fact 
that 0 is continually making orbits around the curve in phase space. It depends on the 
time tiong that we took to make the change. The other term that we call A@ is more 
interesting. From (4.167) it is given by 


tons / AO \ . a0 
a GAT 4.1 
IN, / (x) dt bx) dy (4.170) 


where we’ve used the fact that the change in A, is adiabatic to replace the integrand 


with its average over one period of the orbit. The final expression above means an 
integration over the curve C that the system traces in parameter space. We see that 
A? is independent of the time tiong taken to make the change. However, it does depend 
on the path that we took through the space of all possible potentials. It is known as 
the ” Hannay angle”. Note that the Hannay angle is invariant under the ambiguity 
(4.169) even though the quantity 00/0, that appears in the integrand isn’t. This 
idea of integrating quantities around loops is an example of ” holonomy”, an important 
concept in modern physics. 


== 


Rather surprisingly, the Hannay angle was first discovered only in 1984. The history 
is interesting. First Berry discovered a similar phase for the wavefunction in quantum 
mechanics (now known as the Berry phase). Many physicists were shocked that such 
a simple and profound idea as Berry’s phase had lain hidden in the formulation of 
quantum mechanics for 50 years and it set off a flurry of theoretical and experimental 
research. Soon after this, Hannay showed that an analogous phase had lain undis- 
covered in classical mechanics for 150 years! Although, in certain examples in celestial 
mechanics, the phase A@ had been correctly calculated, the general theory lying behind 
it had not been appreciated. We now describe this theory. 


The first step is to use a higher dimensional version of Stokes’ theorem to express 
the contour integral (4.170) as a surface integral 


ð / a0 ð / a0 
Da [ (2 (2) ~ Dy (2) dAa (4.171) 


where S is a surface in parameter space bounded by the curve C and dA,y is the in- 


finitesimal surface element. 


Claim: The Hannay angle can be written as 


d 


Mga 
dl J 


Was dAap (4.172) 


where the anti-symmetric matrix W,, (known mathematically as a 2-form) is given by 


0 al 
War = ( -a ) (4.173) 
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Proof: To start with let’s think about the averaging procedure a little more. In 

equation (4.157) we wrote (A) as a time average, integrating over a single period. We 

could equally as well write it as an angle average, 

dq'dp' 
27 


(A) = f A(I, 0) do = / Ald’, p) 8 — I) (4.174) 


where in the second equality we integrate over all of phase space and insert the delta 
function ô(T' — I) to restrict the integral to the orbit over the curve with action J. It’s 
this latter formula that we’ll use. This allows us to write, 


a / a0 ð /d0\ [a fə A sur — n 
Ora \O\w/ Ow \Oa/ lat IM Ow Dra On 
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which completes the proof. I haven’t included any examples here of the Hannay angle: 
some simple ones can be found in the original literature? and more advanced applica- 
tions can be found by googling ” Hannay angles”. For the pendulum example, in which 
the length of the pendulum varies, the Hannay angle vanishes. This is because there is 
only one parameter to vary, while a non-trivial A6 occurs only if we make a non-trivial 
loop C in parameter space. 


4.7 The Hamilton-Jacobi Equation 


In this section we will describe yet another viewpoint on « 
classical dynamics, known as Hamilton-Jacobi theory. It will Amal 
tie together several concepts that we’ve met so far. Recall 
from section 2.1 the principle of least action. We define the 
action 


4 
S=] Laita (4.176) 
j Figure 68: 
which we evaluate for all paths q(t) with fixed end points 


CONSE » Ghag (4.177) 


a 


Then the true path taken is an extremum of the action: 6S = 0. 


Now let’s change perspective a little. Consider the action evaluated only along the 
true path qss*!(t) and define 


i 


wig go T) as S [geet] (4.178) 
While S is a functional on any path, W is to be considered as a function of the initial 
final 


i 


initial 
i 


and final configurations q and q;”®™ as well as the time T it takes to get between 


them. 


°J. Hannay, ” Angle Variable Holonomy in Adiabatic Excursion of an Integrable Hamiltonian” J. 
Phys A, 18 221 (1985); M. Berry, ” Classical Adiabatic Angles and Quantal Adiabatic Phase” J. Phys 
A, 18 15 (1985). 
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Now let’s ask what happens if we keep qi“! fixed but vary the end point qi™*!. We 
can go back to the analysis of section 2.1 to see that when the action is varied it looks 


like 
T T 
OL d /ðL OL 
fat a (og) | O+ fagis], (4.179) 
If we evaluate this on the classical path the first term vanishes. We’re left with 
= =p. 4.180 
Bal T Basle oe 


The next thing we want to compute is OW/OT. Let’s start by considering a classical 


initial 


path with fixed initial configuration q; 
than before, so T + T + êT. Then we have 


dW OW | OW Sna — OW 


We'll let the path run on a little longer 


final - final 
= | final af 4.181 
But this total derivative is easily calculated since dS/dT = L, or 
dW 
oa = Leger), geessical (p), T) a hig. qo T) (4.182) 
So we arrive at the equation, 
OW 
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= —H(g™", pia T) (4.183) 
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At this stage, the only time in the game is T and the only position in the game is qi". 
So we can simply drop the word “final”, and relabel T — t. We have found ourselves 
a time dependent function on configuration space W = W (qi, t) which satisfies 


OW OW 
= Di d — = —H (qi, pit 4.184 
m ae T (di, Pi, t) (4.184) 
or, substituting the first into the second, we have 
OW 


This is the Hamilton-Jacobi Equation. 


We’ve shown how a solution to the Hamilton-Jacobi equation can be constructed 
by looking at the classical action of paths which reach a point q; at time T starting 


. The starting point qtie! 


a 


from some initial reference point gimt@! can be considered 
integration constants. In fact, there are more general solutions to the Hamilton-Jacobi 


equation, although all are related to the classical action in a similar way. 
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Suppose we find a solution to (4.185). What do we do with it? We’re now armed 
with some time-dependent function W(q;,t) on configuration space. We combine this 
with the first of Hamilton’s equations which reads 


__ OH 


qi 


= (4.186) 
Opi pi=OW/0qi 
where, on the right-hand-side, we’ve replaced every appearance of the momenta p; by 
a function of the coordinates using p; = OW/0q. What we're left with is n first-order 
differential equations for the evolution of gq. In this manner the function W determines 
the path of the classical system: start it off at a point in configuration space and W 
can be considered as a real valued classical wavefunction which tells it how to evolve. 
What we need to show is that the evolution dictated by (4.186) does indeed satisfy the 
equations of motion. In other words, we should prove that the second of Hamilton’s 
equations, pj = —OH/0q;, is satisfied. We have 
d (5) Ow. PW 


_ = ut 4.1 
dt \ ðq) qð É tðq; ee) 


Di 
But differentiating the Hamilton-Jacobi equation (4.185) with respect to q;, we see that 
we can rewrite the right-hand-side of this equation using 


W dH OH PW dH. ew 
dtOqg Og Opj0G0q OG dqidq, 


(4.188) 


So that (4.187) becomes p; = — H /ðq; as required. 


Let’s see what we’ve done. We’re used to dealing with second order differential equa- 
tions for the time evolution on configuration space (i.e. Lagrange’s equations) and 
first order differential equations for time evolution on phase space (Hamilton’s equa- 
tions). But the Hamilton-Jacobi approach allows us to incorporate n of the integration 
constants in the function W(q;,t) so that we’re left solely with first order differential 
equations on configuration space given by (4.186). 


When we have conservation of the Hamiltonian, so OH /Ot = 0, there is solution of 
the Hamilton-Jacobi equation of a particularly simple form. Define 


W (qi, t) = W° (q:) — Bt (4.189) 
for some constant Æ. Then the time dependence drops out and we get the equation 


H(q;, W° /0q;) = E (4.190) 
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W° is known as Hamilton’s principal function. The spe- \ 
cial property of this solution to the Hamilton-Jacobi D S 
equation is that every path in configuration space de- ` 


termined by the function Wọ has the same energy E. , 
With a little thought, we can envisage how to construct ' ! ` 
solutions to (4.190). Start with a co-dimension one sur- er 
face in configuration space which we will specify to be a 

surface of constant Wo. (Co-dimension one means that Figure 69: 

the surface has dimension (n—1): it splits the configura- 

tion space in two). At any point in this surface, the potential energy V (q) is determined. 
Since p; = OW,/0q;, the momentum is perpendicular to the surface and in the direc- 
tion of increasing Wo. Its magnitude is fixed by requiring that the total energy is E. 
But this magnitude then tells us the position of the next surface of constant Wo (with 
incremental increase). In this manner, it should be clear that, in multi-dimensional 
configuration spaces there are many solutions to (4.190). It should also be clear that 
something singular happens to Wo in regions where V(q;) = 0. Finally, we note that 
even when 0H/0t = 0, there exist other solutions W to (4.185) which encode families 
of trajectories on configuration space which have different energies. 


4.7.1 Action and Angles from Hamilton-Jacobi 


For the majority of examples the Hamilton-Jacobi approach doesn’t give a particularly 
useful way for solving a problem; its utility really lies in the structure it reveals about 
classical dynamics. So rather than go through the gymnastics of solving a complicated 
problem using this method, let us focus on a rather simple example which which il- 
lustrates connections between the different ideas we’ve seen. A system with a single 
degree of freedom has Hamiltonian 


He — +V(q) (4.191) 


Since the Hamiltonian is time independent, energy is conserved and the solution to 
the Hamilton-Jacobi equation has a single integration constant, let’s call it 6, which 
is necessarily some function of the energy. In the above discussion we were a little lax 
about showing these integration constants explicitly, but let’s do it now: we'll write 
W = W(q,t; 8) with 6 = (E). Now we ask a somewhat strange question: suppose we 
try to perform a canonical transformation from (q,p) to new coordinates (a, 3) such 
that 8 is the new momentum. What is the new coordinate a? 
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Since we wish the change of coordinates to be canonical, we must be able to write 
q = g(a, 8) and p = pla, 8) such that 


aco} _ Og Op — Og Op | 
DPS) = B88 bda 


Using p = OW/0q and remembering what all depends on what all (W = W (q, 8) and 
q = q(a, 8) and p = pla, B)) we can write this as, 


fap} = oq W PW ôq _ 0g0°W Og _ OG ð OW 
1PS(a8) = Ja (dbg OG OB) AB OG ða ða ðq \ OG 


and we find that the transformation is canonical if we take a = OW/OG. Note the nice 


(4.192) 


) (4.193) 


symmetry here: we have a solution W (q, t; 3) to the Hamilton Jacobi equation and we 
can think in terms of canonical coordinates (q, p) or alternatively (a, 3) where 


OW OW 
ae 4.194 
P Oq í ʻ Ob (4194) 


The function W is an example of a generating function of the second kind (4.120). 


So what to do with this? Let’s look at some examples. Take 8 = E, so that our new 
momentum variable is the energy itself. What is the canonical coordinate? If we write 
W (q, t; E) = Wo(q, E) — Et then the coordinate canonically dual to E is 


a = ——(q,E)-t (4.195) 


Taking the time dependence over the left-hand-side, we see that œ has the interpretation 
of —to, the initial starting time. This tells us that we may parameterise every trajectory 
in a one-dimensional system in terms of the energy and starting time of the path, and 
that these are canonical variables. Again we see the dual relationship between energy 
and time. Note that both E and tọ are independent of time; we’ve found canonical 
variables for which neither the coordinate nor the momentum vary along the path. 


As another example consider the case of @ = J, our action variable of Section 4.5. 
What is the canonical coordinate a in this case? We expect that it will be related to 
the angle variable 0. To see this, we use the fact that W is the classical action to write 


W= fLarvet=[(L+Hae= | gpat= | pa (4.196) 


So we have that 
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where we’ve used our expression (4.144) for the angle variable, as well as the equation 
(4.141) for the frequency of motion w. So we see that a is not quite equal to 0, 
but is shifted by a term linear in time. In fact this means that a itself does not 
change in time. Once again, we’ve arrived at a way to parameterise the orbits of 
motion by canonical variables which do not themselves change with time. In fact, in 
most presentations, this is the starting motivation for the Hamilton-Jacobi approach 
to classical dynamics and, even for higher dimensional systems, the function W can 
be thought of as a way to generate new, time independent, canonical variables. More 
discussion on the relationship between canonical transformations, angle-action variables 
and the Hamilton-Jacobi formulation can be found in the book by Hand and Finch, or 
deep within Goldstein. 


4.8 Quantum Mechanics 


One of the primary reasons for studying the rather formal aspects of classical mechanics 
discussed in this course is to make contact with quantum mechanics. For this reason, 
in this last section of the course we will illustrate the connection between the classical 
and quantum world and point out a few analogies that might just make the quantum 
behaviour look a little less weird. (Just a little less: after all, it really is weird!) 


In classical mechanics the state of a system is described by a point (q;,p;) in phase 
space. In quantum mechanics the state is described by a very different object: a 
complex valued wavefunction %(q) over the configuration space. The observables are 
operators on the space of wavefunctions. The standard representations of the position 
operator g; and momentum operator p; are 


o 
bla) = nee (4.198) 
qi 
which leads to the well known Heisenberg commutation relations 


Of course, we’ve already seen something very familiar in section 4.3 on Poisson brackets 
as summarised in equation (4.61). Although the bilinear, antisymmetric operators 
|, ] and {, } act on very different spaces, they carry the same algebraic structure. 
Heuristically the relations (4.61) and (4.199) both represent the mathematical fact 
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that momentum p; generates infinitesimal translations of q;: in classical mechanics we 
saw this in when we studied infinitesimal canonical transformations in section 4.4.1; in 
quantum mechanics it follows from the representation (4.198) and Taylor’s expansion. 


In general the map between a classical system and a quantum system goes via the 
Poisson brackets and was formulated by Dirac: 
1 


{ , }elassical > -zl ; lease (4.200) 


This prescription for going between the classical and quantum theories is known as 
canonical quantisation. It also gives rise to the quantum equations of motion. In the 
Poisson bracket language, we have seen that the classical equation of motion for an 
arbitrary function f(q,p) is 


fatty} — ihf=[f,Êĝ] (4.201) 


which is the equation of motion in the Heisenberg picture, in which the time dependence 
is assigned to the operator rather than the state. 


While a great physicist, Dirac was never much of a storyteller. It shows in the 
following anecdote recounting his graduate student days: 


“I went back to Cambridge at the beginning of October 1925, and resumed 
my previous style of life, intense thinking about these problems during the 
week and relaxing on Sunday, going for a long walk in the country alone. 
The main purpose of these long walks was to have a rest so that I would 
start refreshed on the following Monday. 


It was during one of the Sunday walks in October 1925, when I was think- 
ing about this (uv — vu), in spite of my intention to relax, that I thought 
about Poisson brackets. I remembered something which I had read up previ- 
ously, and from what I could remember, there seemed to be a close similarity 
between a Poisson bracket of two quantities and the commutator. The idea 
came in a flash, I suppose, and provided of course some excitement, and 
then came the reaction “No, this is probably wrong”. 


I did not remember very well the precise formula for a Poisson bracket, 
and only had some vague recollections. But there were exciting possibilities 
there, and I thought that I might be getting to some big idea. It was really 
a very disturbing situation, and it became imperative for me to brush up on 


== 


my knowledge of Poisson brackets. Of course, I could not do that when I 
was right out in the countryside. I just had to hurry home and see what I 
could find about Poisson brackets. 


I looked through my lecture notes, the notes that I had taken at various 
lectures, and there was no reference there anywhere to Poisson brackets. 
The textbooks which I had at home were all too elementary to mention them. 
There was nothing I could do, because it was Sunday evening then and the 
libraries were all closed. I just had to wait impatiently through that night 
without knowing whether this idea was really any good or not, but I still 
think that my confidence gradually grew during the course of the night. 


The next morning I hurried along to one of the libraries as soon as it 
was open, and then I looked up Poisson brackets in Whitackers Analytical 
Dynamics, and I found that they were just what I needed.” 


4.8.1 Hamilton, Jacobi, Schrodinger and Feynman 


While the Poisson bracket structure of quantum mechanics dovetails nicely with Heisen- 
berg’s approach, the Hamilton-Jacobi equation is closely tied to Schrédinger’s wave 
equation. Let’s first recall what Schrodinger’s equation looks like for a one-dimensional 
system with a Hamiltonian operator H = p?/2m+V(q) acting on wavefunctions 7(q), 
OW a K Ow 
th — = Hy = -— — +V 4.202 
a EY = —on Og (a)y (4.202) 
where we have used the representation (4.198) for the position and momentum opera- 
tors. To see the relationship of this equation to classical dynamics we decompose the 
wavefunction into the modulus and phase, 


w(q,t) = R(q, t) bao’ (4.203) 


where R and W are both real functions. We know that R is related to the probability 
P for finding a particle at position q at time t: P(q,t) = |w(q,t)/? = R(q,t)?. But what 
is the interpretation of the phase W? Let’s substitute this decomposition of ~ into the 
Schrodinger equation to find 
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At this stage, we take the classical limit A —> 0. Or, more precisely, we consider a 
situation with 


ow 


h Dee 


(4.204) 


OW 
ôq 
which can be understood physically as the requirement that the de Broglie wavelength 


of the particle is much smaller than any other length scale around. Either way, collecting 
together the terms above to leading order in A we find 
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which we recognise as the Hamilton-Jacobi equation (4.185). So in the classical limit 
the phase of the wavefunction is understood as the classical action of the path taken 
by the particle. 


Finally, let us finish on the same topic that we started: the principle of least action. 
Recall from section 2.1 that we can determine the true path of a system by assigning a 
number, called the action S, to every possible path. The equations of motion are then 
equivalent to insisting that the true path is an extremum of S. But what about all the 
other paths? Do they play any role in Nature? The answer is that, in the quantum 
world, they do. Suppose a particle is observed to be at position q; at time t = 0. Then 
the probability P that it will later be observed to be at position qș at time t = T is 
encapsulated in the wavefunction %(qs, T). The Feynman path integral formula for the 
wavefunction is 


df : 
WaT) =N | Dale) ese (4.206) 
qi 


The N here is just a normalisation constant to ensure that probabilities add up to one: 
i.e. f |ẹb(q)| dq = 1. The tricky part of this formula is the integral: it is a sum over all 
possible paths from q = q; at time 0 to q = qs at time T. These paths are weighted 
with their action. It’s as if the particle really does take every possible path, but with 
a particular phase. In the limit h —> 0, the phases oscillate wildly for any path away 
from the classical equation of motion 65 = 0 and they cancel out in the integral. But 
for situations where A is important, the other paths are also important. 


Let’s prove that the wavefunction defined by (4.206) satisfies the Schrödinger equa- 
tion. Firstly we need to understand this integral over paths a little better. We do this 
by splitting the path into n small segments, each ranging over a small time ôt = t/n. 
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Figure 70: Discretising the paths. 


Then we define 
n +00 q 
4 dk 
[ow = tim l J. T (4.207) 


where g, is the position of the particle at time t = kôt. In this expression Č is a 
constant that we’re going to figure out shortly that will be required to make sense of 
this infinite number of integrals. In any given segment, we treat the path as straight 
lines as shown in the figure and replace the action with the appropriate quantity, 


T n 2 
a l 3 mM (dk+1 — qk) _ dk+1 + qk 
S -f dt Gz vo) > X (5 yi ôt V = 
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Then to prove that w defined in (4.206) satisfies Schrédinger’s equation, let’s consider 
adding a single extra time step to look at the wavefunction at time t + dt. We can 
Taylor expand the left hand side of (4.206) happily 


o 
war, T + ôt) = Yla, T) + w ôt + O(t’) (4.208) 
while the right hand side requires us to do one extra integral over the penultimate posi- 
tion of the path q’. But the first n integrals simply give back the original wavefunction, 
now evaluated at q’. We get 


+o dg im (q= q) 4 ap +q f 
a g P É s: av ( 5 )] w(q',t) (4.209) 


The term in the exponent means that the integral oscillates wildly whenever q’ is far 


from qr and these regions of the integral will all cancel out. We can therefore Taylor 
expand around (qs — q’) to rewrite this as 


hae im (gs — d)? idt 
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At this stage we do the integral over q’. We’ll use the formulae for Gaussian integration 


1 
fac = E ; foy =0; fope = "E (4.211) 


Then equating the lefthand side (4.208) with the righthand side (4.210), we have 
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At this stage we see what the constant C has to be to make sense of this whole calcu- 
lation: we should take 
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so that C — 0 as dt + 0. Then the terms of order O(dt®°) agree. Collecting the terms 
of order O(6t), and replacing the time T at the end point with the general time t, we 
see that we have 

Ow h? Ow 


ita ar T + Vas) = Hy (4.213) 


and we recover Schrodinger’s equation as promised. 


4.8.2 Nambu Brackets 


Throughout this section, we’ve seen that several of the structures appearing in quantum 
mechanics were anticipated, in some form, within the framework of classical dynamics. 
You just need to know where to look. One might wonder whether classical mechanics 
also contains other structures which will prove to be important in future discoveries. 
Or, alternatively, whether there are ways to extend the framework of classical dynamics 
that hints at new ways to formulate the laws of physics. In this section, Pll briefly 
describe such an extension due to Nambu in 1973. I should confess immediately that 
there’s no known use for Nambu’s formalism! And it’s not at all clear that one will be 
found! But then again, maybe it holds the clue that will prove crucial in the search for 
the ideas beyond the known laws of Nature. 


We've seen that the Hamiltonian framework deals with canonical pairs of coordinates 
and momenta (qi, pi) with i = 1,...,n. Nambu’s idea was to extend this to triplets of 
objects (qi, pi, ri) with i = 1,...,n. We don’t say what this extra variable r; is: just 
that it is necessary to define the state of a system. This means that the phase space 
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has dimension 3n. The Nambu bracket, which replaces the Poisson bracket, acts on 
three functions f,g and h in phase space, 
Of 0g Oh Of Og Oh Of Og Oh 
h} = — H 
9: ) 2 Og: Op; Or; oqi Or; OD; Or; Od Op; 
B Of Og Oh | Of Og Oh _ Of Oh Əh 
Or; Op; Oqi Op; Or; Oqi Op; OG Or; 


This satisfies similar properties to the Poisson bracket, including linearity and 
e Anti-symmetry: {f,g,h} =—{g, f,h} = {9,h, ft. 
e Leibniz: {fg, hl} = f{g,h,l} + {f, h, l}g. 
e “Jacobi”: {{f,g, h}, l,m} + {hħ, {f,9,1} m} + th bifg, mp} = {hg th bmp}. 


In order to specify time evolution, we need two ” Hamiltonians”. We call them H (q, p,r) 


(4.214) 


and G(q,p,r). Then any function f over phase space evolves as 


df 


“a {7,G,H} (4.215) 
In particular, the new version Hamilton’s equations read 
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ri 
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where there’s no sum over 7 on the right-hand side of these equations. By the anti- 
symmetry of the Nambu bracket, we learn that both H and G are conserved (as long 
as neither have explicit time dependence). 


Many of the key features of classical dynamics are retained in Nambu’s formalism. 
For example, Liouville’s theorem still holds. (This was Nambu’s original motivation 
for suggesting this framework). Similarly, canonical transformations can be defined as 
a change of variables q; > Qi(q,p,r) and p; > P;(q,p,r) and r; > R;(q, p,r) such that 
the Nambu bracket structure is preserved, for example 


1 ifi=j=k 


(4.217) 
0 otherwise 


{Qi Pi, Re} = i 


together with similar equations involving other combinations of Q’s, P’s and R’s. 


” Hamilton’s” equations (4.216) are invariant under these canonical transformations. 
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The Nambu bracket provides a generalisation of classical dynamics. But can we 
quantise it? In other words, can we find some operators which reproduce the Nambu 
bracket structure (up to a factor of h and perhaps an 7) in much the same way that the 
usual quantum theory ties in with the Poisson bracket? This turns out to be pretty 
tricky. In particular, it seems difficult to keep all three conditions: anti-symmetry, 
Leibniz and Jacobi. Perhaps this suggests that the correct mathematical structure has 
not yet been uncovered. Perhaps it suggests that the Nambu bracket is just not useful! 


Chances are that you won’t ever have any use for the Nambu bracket. But you never 
know. Perhaps one day, like Dirac, you’ll return from wandering the fields around 
Cambridge and desperately need to recall this concept. But, unlike Dirac, you’ll be 
able to find a reference in your lecture notes (the notes you had taken at various 
lectures). 
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